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Group Art Unit: 1638 

REMARKS ^ 

Reconsideration of the present application is respectfully requested. Claims 
1-7 are pending. Claims 1-4 and 7 have been amended. Claims 75-84 have been 
added. Support for the amendments is found in the claims as originally filed, and 
throughout the specification. No new matter has been added. 

ELECTION 

The election of Group I (claims 1-7) with traverse has been made final. 
Claims 8-74 are non-elected and withdrawn from consideration. Applicants reserve 
the right to pursue these claims in a continuing application. 

Double Patenting Rejection 

Claims 2-7 are rejected under the judicially created doctrine of obviousness- 
type double patenting as being unpatentable over claims 1-10 of U.S. Patent No. 
6,235,972. 

Applicant respectfully disagrees. The current application is a divisional 
application filed in response to the Restriction Requirement of U.S. Application Serial 
No. 09/426,557, now U.S. Patent No. 6,235,972. Claims 2-6 are not obvious in light 
of claims 1-10 of U.S. Patent No. 6,235,972. Claim 1 has been amended and claims 
an isolated polypeptide comprising an amino acid sequence having at least 80% 
sequence identity over the entire length of SEQ ID NO: 2 or 4, wherein the amino 
acid sequence encodes a polypeptide involved in nucleotide excision repair. Claims 
3 and 4 have been amended and now depend from claim 1, therefore claims 3-7 are 
removed from the double patenting rejection. Claim 2 is directed to a recombinant 
expression cassette which expresses the polypeptide of claim 1 Therefore, the claim 
is defined by the polypeptide which was restricted out of U.S. Application Serial No. 
09/413,574 as a separate invention. The claim is consonant with the polypeptide 
claim restricted, therefore it is respectfully requested that the double patenting 
rejection be withdrawn. 
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Rejection under 35 U.S.C S101 - Utility 

Claims 1-7 are rejected under 35 U.S.C §101 because the claimed invention 
is not supported by either a specific and credible asserted utility or a well-established 
utility. The Examiner asserts for claim 1 (a) "It is unclear as to how this sequence 
would be useful, lacking its functional activity". The Examiner further states that 
claim 1 (b)-(d) lack a recitation of function. 

Applicants respectfully disagree, page 2, lines 3-17 of the specification clearly 
details the well-established utility of Rad23 polypeptide involved in DNA nucleotide 
excision repair. In nucleotide excision repair, Rad23 is involved in transcription- 
coupled repair as well as over DNA repair. Rad23 has also been shown to interact 
with a number of proteins involved in DNA repair, transcription, proteolysis and cell 
cycle and has been implicated in ubiquitin mediated proteolysis and cell cycle 
regulation. The present invention proposes to use the well-established utility of 
Rad23 in order to modulate repair activity, and recombination, in order to improve 
gene targeting and transformation in plants (see page 2, lines 13-17) therefore 
establishing specific, substantial and credible utility for the present invention. 

Claim 1 has been amended to delete parts a, b, and d, and further to recite 
80% sequence identity over the entire length of the sequence, wherein the amino 
acid sequence encodes a polypeptide involved in nucleotide excision repair. 
Support for this amendment is found on page 2, lines 3-17, and page 50, lines 7-12 
of the specification. New claims 75-84 have been added. New claims 75-77, which 
depend from claim 1, recite 85, 90, and 95% sequence identity respectively. 
Support for this amendment can be found on page 50, lines 7-12 of the specification. 
New claim 78, which depends on claim 1, recites the polypeptides of SEQ ID NOS: 2 
and 4, and therefore finds support in original claim 1 , part (b). Claim 1, parts (a) and 
(d) have been deleted and are not represented in the current amendment. As 
amended the claims now recite the functional activity. 
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It was well-known at the time of filing that Rad23 homologues are highly 
conserved throughout various eukaryotes from yeast to mammals to plants. The 
highly conserved amino acid motifs were also well known, for example, see Refs A1 , 
A4, A7, A8, and A10 of the IDS submitted 1 1/12/2000 for Serial No. 09/413,574, now 
U.S. Patent 6,235,972. For example, multiple sequence alignments are presented in 
Appendix A, and further sequence analysis is presented in Appendix B. These 
analyses show the overall homology and conserved domains shared by the claimed 
sequences and other Rad23 sequences. Further, these analyses could be used to 
predict protein regions likely to be tolerant of amino acid substitutions and also 
indicate likely acceptable substitutions. 

Appendix A presents two multiple sequence alignments performed using the 
PileUp program in the GCG sequence analysis suite. The first alignment shows 
SEQ ID NO: 2 aligned with Rad23 homologues from yeast (S. cerevisiae), human, 
rice and carrot. The second alignment contains SEQ ID NO: 4 aligned with the 
same sequences. The alignments indicate homology to Rad23 sequences across 
the entire length of SEQ ID NO: 2 and 4, and that SEQ ID NO: 2 and 4 show a high 
level of conservation with the ubiquitin-like (ubi), PAST, and ubiquitin-associated 
(uba) motifs of the Rad23 family, as seen by previous workers (for example, see 
Refs A1 , A4, A7, A8, and A10 of the IDS submitted 1 1/12/00 for Serial No. 
09/413,574, now U.S. Patent 6,235.972). 

Appendix B presents sequence analyses of SEQ ID NO: 2 and SEQ ID NO: 4 
performed by the BioScout suite of programs and includes the summary report, the 
3D structure alignment to the UBA(2) domain of human Rad23A (HHR23A) crystal 
structure PDB 1F4I (Withers-Ward ef a/., 2000 Biochemistry 39:14103-14112, 
enclosed, see also NMR structure Dieckmann etaL 1998 Nat. Struct. Biol. 5:1042- 
1047, enclosed), Pfam analyses results which indicate the presence of three Rad23 
Pfam domains (SEQ ID NO: 2 e values = 7.8e -23, 9.5 e -12, and 4.5 e-06; SEQ ID 
NO: 4 e values = 2e -09, 2.7 e -1 1 , and 2.3e -23), SEQ ID NO: 2 also shows an 
uncharacterized domain (e value 7.2), the alignments of the SEQ ID NO: 2 and 4 
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with the Pfam domains, and descriptions of the Pfam domains. Therefore, Applicant 
has shown the claimed sequences comprise extensive homology and conserved 
functional domains with known Rad23 sequences. 

Support for polypeptides having a given percent sequence identity is taught in 
the specification as originally filed, for example, in SEQ ID NOS: 1-4; guidance on 
codon preferences (page 8, lines 5-25, and page 64, line 10 - page 65, line 6); 
codon degeneracy (page 6, line 23 - page 7, line 13, and page 27, line 31 - page 
28, line 3); sequence comparisons (page 19, line 3 - page 24, line 24, and page 66, 
line 1 - page 69, line 3); conservative substitutions and variants (page 6, line 28 - 
page 8, line 4); polypeptide expression (page 42, line 15 - page 48, line 6, page 51, 
line 4 - page 54, line 16, and page 56, line 31 - page 57, line 12), isolation (page 
57, line 14 - page 58, line 2) as well as the Examples. 

Assays for Rad23 activity were known in the art at the time of filing. For 
example, nucleotide excision repair assays (e.g., Mueller & Smerdon 1996, Ref. A3, 
Wang etaL 1997, Ref. A4, and Sugasawa etaL 1997, Ref. A6, in IDS submitted 
11/12/99 for Serial No. 09/413,574, now U.S. Patent 6,235, 972), complementation 
of the yeast Rad23 mutants (e.g., Watkins etaL 1993 Ref. A1, Mueller & Smerdon 
1996, Ref. A3, and Sturm & Lienhard 1998 Ref. A10, in IDS submitted 11/12/99 for 
Serial No. 09/413,574, now U.S. Patent 6,235, 972). Also available were yeast two- 
hybrid screens, gene knockouts, UV sensitivity, binding assays, co-purification, gels, 
blots, co-immunoprecipitation, and immunolocalization. Therefore, one of skill in the 
art could readily identify macromolecules having Rad23 activity. 

Applicants believe that the claimed invention has a well-established utility for 
which they have proposed specific, substantial and credible uses. Therefore, it is 
respectfully requested that the rejection of claims 1-7 under 35 U.S.C §101 should 
be withdrawn and not applied to new claims 75-84. 
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Rejections under 35 U.S.C. §112, 1^^ Paragraph, Enablement : 

Claims 1-7 are rejected under 35 U.S.C. §112, first paragraph, because the 
specification does not enable any person skilled in the art to which it pertains, or with 
which it is most nearly connected, to make and/or use the invention commensurate 
in scope with these claims. Specifically, since the claimed invention lacks utility for 
the reasons set forth in the rejection under 35 U.S.C. §101, one skilled in the art 
would not know how to use the invention. 

Applicants believe they have properly addressed the rejection under 35 
U.S.C. §101 by amendment and argument and therefore respectfully request that 
the corresponding rejection of claims 1-7 under 35 U.S.C. §1 12, first paragraph for 
lack of enablement be withdrawn. 

Claims 1-7 are further rejected under 35 U.S.C. §1 12, first paragraph, 
because the specification does not enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and/or use the invention 
commensurate in scope with these claims. 

The Examiner asserts that the specification does not provide enablement for 
an isolated polypeptide comprising at least 20 contiguous amino acids of SEQ ID 
NO: 2 or 4, a polypeptide of SEQ ID NO: 2 or 4, a polypeptide having at least 70% 
sequence identity to SEQ ID NO: 2 or 4, or to a polypeptide encoded by a 
polynucleotide that will hybridize under stringent conditions to SEQ ID NO: 1 or 3. 
The Examiner asserts that the specification only teaches that a RAD23 gene of 
yeast was known and that the Applicant believes that the disclosed sequences are 
of a maize RAD23. The Examiner states that there is no evidence that SEQ ID NO: 
2 or 4 have similar structural features to the yeast RAD23 gene, nor is there 
evidence of functional activity for the claimed polypeptides. The Examiner states 
that the specification fails to provide guidance regarding how one skilled in the art 
would evaluate SEQ ID NO: 2 or 4 for a specific activity. 
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Applicants respectfully disagree. Applicants disclose two polynucleotide 
sequences, each of which encode a full-length polypeptide having overall homology 
to known Rad23 sequences and further containing conserved motifs seen in the 
known Rad23 sequences (see above). It is respectfully submitted that 35 U.S.C. 
§112 does not require a working example. Further, case law does not require that 
an application provide a working example. It is only required that the application 
teach the person skilled in the art how to make and use the invention. The present 
application meets that requirement. 

Similarly, the disclosure of SEQ ID NOS: 1-4, the conserved domains and 
motifs known in the art as shown in Appendix A & B and the IDS filed 1 1/12/99, the 
sequence identity and similarity to other known Rad23 sequences, the guidance on 
sequence analyses, comparison, and identity, the guidance on codon degeneracy, 
silent variants, and preferences, the guidance on conservative amino acid 
substitutions, and the ready availability in the art for assays for Rad23 (all as noted 
above) show the specification coupled with the knowledge in the art enables a 
person in the art to make and use sequences having 80% sequence identity to SEQ 
ID NO: 2 or 4 involved in nucleotide excision repair 

The present invention discloses how to make and use the sequences of the 
invention, as discussed above. The question of experimentation is a matter of 
degree. The fact that some experimentation is necessary does not preclude 
enablement; what is required is the amount of experimentation must not be unduly 
extensive. PPG Inc, v. Guardian Industries Corp. (37 USPQ 1218, 1623, (Fed. Cir. 
1996). The test is not merely quantitative, since a considerable amount of 
experimentation is permissible, if it is merely routine, or if the specification in 
question provides a reasonable amount of guidance with respect to the direction in 
which the experimentation should proceed to enable the determination of how to 
practice a desired embodiment of the invention claimed. Ex parte Jacl<son, 217 
USPQ 804, 807 (1982 PTOBA). 
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The Examiner asserts that sequence homology is not sufficient to predict 
function, citing Doerks etaL (TIG 14(6):248-250 1998), Smith etaL {Nature Biotech, 
15:1222-1223 1997), Brenner {TIG 15(4):132-133 1999), and Bork {TIG 12(10):425- 
327 1996). 

Applicant respectfully disagrees. These references all point to the potential 
accumulation and propagation of errors from automatic computer annotation of high 
throughput sequencing. This kind of sequencing and annotation was frequently a 
one-pass sequencing and annotation by top local alignment hit. These do not state 
that analyses of sequence homology are not predictive of function, these point to the 
problem of automatic annotation based on one top hit in a database. Doerks et ai 
actually try to predict the function of uncharacterized protein families (UPFs) through 
sequence analyses alone (page 248, column 1-3). Smith points out that the 
problems do not lie in the homology search algorithms, but in minor database 
annotation inconsistencies (page 1222, col. 1, 3'^ paragraph - col. 2, 1^* paragraph). 
Brenner's statement that ".. then most homologs must have different molecular and 
cellular functions" is prefaced by "if there are only about a 1000 major superfamilies 
in nature" (emphasis added). Brenner examined the annotations by three groups for 
the complete genome of /W. genitalium, and estimated the minimum error rate of 
annotations to be 8%, therefore the correct annotation rate would be approximately 
92%. Bork states "we wish to point out that sequence databases are the most useful 
tool in sequence analysis and the question should be how can one further improve 
their value" (page 472, col. 1,1®* full paragraph). Taken as a whole, these 
references warn of the potential error of relying on one automated annotation to 
identify a gene homologue, but do not state that homology is NOT indicative of 
function. 

Examiner cites Van de Loo et al. (PNAS 92:6743-6747 1995) as an example 
that gene activity cannot be determined merely by similarity of sequences. 
Examiner states that the reference teaches that a change in only four amino acids 
will convert a desaturase gene to a hydroxylase gene. 
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Applicant respectfully disagrees. It was the similarity of the activities, 
cofactors and substrates that suggested to van de Loo et aL that there might be 
sequence similarity as well (page 6743, col. 1-2). The hydroxylase could not be 
purified by biochemical means, therefore van de Loo etaL used differential 
screening and sequencing to identify seed-specific clones that were similar to 
desaturase. A hydroxylase cDNA clone was identified and confirmed based on its 
similarity to a desaturase. Applicant found no teaching of amino acid changes to 
convert a desaturase to a hydroxylase in this reference. Applicant believes the 
Examiner may be referring to Broun etaL (Science 1998 282:131-133) in which very 
specific limited amino acid changes which resulted in alteration of the experimental 
protein's catalytic activity. Broun discloses modifications to a oleate 12-desaturase 
sequence which resulted in the protein having hydroxylase activity. Broun et al. 
actually note the high sequence similarity between the oleate 12-desaturase and 
oleate hydroxylase and use this to identify seven residues conserved in desaturases 
and to target them for modification and activity (see page 131, column 2 - column 
3). Broun et al. actually use the sequence similarity of the desaturase and the 
hydroxylase to predict which residues to change to alter the activity of the 
desaturase. 

Applicants submit that no more than routine experimentation is required. This 
may be accomplished by the examples and methods within the present application 
and within the technical, scientific, skill in the art. The disclosure of SEQ ID NOS: 1- 
4, the knowledge in the art of other Rad23 sequences and the conserved regions, 
the guidance on conserved motifs and amino acids, and sequence comparisons as 
discussed above, and the ready availability of routine Rad23 assays enabled one of 
skill in the art to make and use the claimed invention. Therefore it is respectfully 
requested that the rejection of claims 1-7 under 35 U.S.C §112, first paragraph be 
withdrawn, and not applied to new claims 75-84. 
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R iections under 35 U.S.C. $112, 1^^ Paragraph, Written Description : 

Claims 1-7 are rejected under 35 U.S.C. §112, first paragraph, as containing 
subject matter not sufficiently described in the specification to indicate the 
inventor(s) had possession of the invention. 

The Examiner asserts that the Applicant has not identified structural features 
necessary to describe features of SEQ ID NO: 2 or 4 which distinguish them from 
any polypeptide, since 30% lack of sequence identity could fall in a region essential 
for protein activity. The Examiner states that a genus is claimed, but that there is no 
description of the structural feature that defines the genus, citing University of 
California v. Eli Lilly, 119 F.3d 1559, 43 USPQ 2d 1398 (Fed. Cir. 1997). The 
Examiner concludes that given the lack of written description in the specification 
regarding the structural and physical characteristics of the claimed compositions one 
skilled in the art would not have been in possession of the genus claimed at the time 
this application was filed. 

The Examiner is reminded that every species encompassed by the claimed 
invention need not be disclosed in the specification to satisfy the written description 
requirement of 35 U.S.C. § 1 12, first paragraph. Utter v. Hiraga, 845 F.2d 993, 6 
USPQ2d 1709 (Fed. Cir. 1988). In fact, the description of a claimed genus can be 
by structure, formula, chemical name, or physical properties. See Ex parte Maizel, 
27 USPQ2d 1662, 1669 (B.P.A.1.1992), (citing Amgen v. Chugai, 927 F.2d 1200, 
1206 (Fed. Cir. 1991)). 

Claim 1 has been amended to recite that the sequence shares 80% sequence 
identity to the sequence of SEQ ID NO: 2 or 4, and further recites that the sequence 
encodes a polypeptide involved in nucleotide excision repair. The recitation of at 
least 80% sequence identity is a very predictable structure of the sequences 
encompassed by the claimed invention. Further, Applicants have provide two 
isolated sequences, SEQ ID NOS: 2 and 4. Contrary to yeast, the presence of two 
Rad23 genes was also seen in other higher eukaryotes such as human, mouse, and 
carrot. (Refs. A4, A6, and A10 in IDS submitted 1 1/12/99 for Serial No. 09/413,574, 
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now U.S. Patent 6,235, 972). Further, Sturm et al. (Ref A10) showed that both 
carrot Rad23 homologues complemented a yeast rad23 mutant. The description of 
a representative number of species does not require the description to be of such 
specificity that It would provide Individual support for each species that the genus 
embraces. 66 Fed. Reg. 1099, 1106 (2000). Satisfactory disclosure of a 
"representative number" depends on whether one of skill in the art would recognize 
that the Applicants were in possession of the necessary common attributes or 
features of the elements possessed by the members of the genus in view of the 
species disclosed. 66 Fed. Reg. 1099, 1 106 (2000). Applicants submit that the 
knowledge and level of skill in the art would allow a person of ordinary skill to 
envision the claimed Invention, i.e., a sequence having at least 80% sequence 
identity to the sequence set forth in SEQ ID NO: 2 or 4. The recitation of a 
predictable structure of at least 80% sequence identity to SEQ ID NO: 2 or 4 is 
sufficient to satisfy the written description requirement. 

In addition, an Applicant may rely upon functional characteristics in the 
description, provided there Is a correlation between the function and structure of the 
claimed invention. Claim 1 recite that the claimed sequences are, or encode, a 
polypeptide involved in nucleotide excision repair thereby providing a functional 
characterization of the sequences claimed In the genus. 

Example 14 of the Revised Interim Written Description Guidelines is directed 
to a generic claim: a protein having at least 95% sequence identity to the sequence 
of SEQ ID N0:3, wherein the sequence catalyzes the reaction A -»• B. The Training 
Materials concludes that the generic claim of Example 14 is sufficiently described 
under § 112, first paragraph, because 1) "the single sequence disclosed in SEQ ID 
N0:3 Is representative of the genus" and 2) the claim recites a limitation requiring 
the compound to catalyze the reaction from A -» B. The Guidelines conclude that 
one of skill In art would recognize that the Applicants were in possession of the 
necessary common attributes possessed by the members of the genus. 
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Following the analysis of Example 14, Applicants submit that claim 1 satisfies 
the written description requirements of § 1 12, first paragraph. Specifically, the 
claims of the present invention encompass sequences having at least 80% 
sequence identity to the sequence of SEQ ID NO: 2 or 4, wherein the claimed 
sequences encode a polypeptide involved in nucleotide excision repair. As in 
Example 14, the specification discloses the amino acid sequences of SEQ ID NOS: 
2 and 4, and the amended claims recite a limitation requiring the sequence to have a 
specific function. 

Consequently, contrary to the Examiner's conclusion, the sequences 
encompassed by the genus of claim 1 are defined by relevant identifying physical 
and chemical properties. In fact, the common attributes or features of the elements 
possessed by the members of the genus is that they are Rad23 sequences and 
share at least 80% sequence identity at the amino acid level to the disclosed 
polypeptide sequences of SEQ ID NOS: 2 or 4. The necessary common features of 
the claimed genus are clear. 

Applicants have disclosed two isolated polynucleotide sequences and the 
encoded Rad23 polypeptides in SEQ ID NOS: 1-4, and have provided further 
guidance on sequence isolation, analysis, and identification, codon degeneracy, 
conserved protein domains and motifs, as well as conservative amino acid 
substitutions. Applicants clearly had possession of sequences having at least 80% 
sequence identity to SEQ ID NO: 2 or 4, therefore the rejection of claims 1-7 under 
35 U.S.C. §112, first paragraph written description should be withdrawn. 

Rejections under 35 U.S.C. S112, 2"^ Paragraph : 

Claims 1-7 are rejected under 35 U.S.C. §112, second paragraph, as being 
indefinite. 

The Examiner asserts that the term "member" renders claim 1 indefinite. 
Also, the Examiner asserts that the terms "selectively hybridize" and "stringent 
hybridization conditions" in claim 1 (d) are unclear. 
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The Applicant respectfully disagrees that recitation of a "nnember selected 
from the group" in the preamble makes claim 1 indefinite since this is common 
Markush group claim format and parts (a) - (d) of the claim clearly recited a 
"polypeptide". However, in order to expedite prosecution, claim 1 has been 
amended to delete parts (a), (c) and (d), and no longer recites "a member". Claim 1 
part (d) has been deleted and not represented Therefore, the rejection of claims 1-7 
under 35 U.S.C. §112, second paragraph is obviated. 

Applicants have properly addressed by argument and amendment the 
grounds for the rejection of claims 1-7 under 35 U.S.C. §1 12, second paragraph and 
respectfully request that the rejection be withdrawn. 



In light of the foregoing remarks and amendments, it is believed that claims 1- 
7, and 75-84 are in condition for allowance. Withdrawal of the outstanding rejections 
and allowance of all of the remaining claims is respectfully requested. 



PIONEER HI-BRED INTERNATIONAL. INC. 

Corporate Intellectual Property 

7100 N.W. 62"** Avenue 

P.O. Box 1000 

Johnston, Iowa 50131-1000 

Phone: (515)270-4192 

Facsimile: (515)334-6883 



CONCLUSION 



Respectfully submitted, 




Virginia Dress 
Agent for Applicant(s) 
Registration No. 48,243 
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Description 



0964rad23.sid2 (Protein) 



0964 rad23 sid2 



Update 



Edit 



Function 



Features 
Summan 



Homologies 



At5g38470/At5g38470 

Direct assignment of functionality by homology to 

tremblnew|AY081835 | AY081835_l 

in region 1 to 403 for overall length of 228 (99% of query, 176% of hit, see the 
ali gnment ). 
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Extracted keywords DNA damage. Nuclear protein. DNA repair 
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3D structure inferred by clear homology from residues 357 to 403 in 1F4I-A 
View alignment 
pdb|lF4I|lF4I-A structure 

Expression of this gene is reported for 
Organ Category other species 

Tissue Classification normal tissue 
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homologous sequences. 
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Homo sapiens, Mus musculus, 
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from residue 357 to 396. Source: [pfamsearch] 
Quality: (E=4.5e-06) 

from residue 244 to 337. Source: [pfamsearch] 
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Quality: (E=9.5e-12) 
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Alignment: 0964rad23.sid2 - pdb|lF4I|lF4I-A 

BLASTP - alignment of 0964rad23.sid2 against pdb|lF4I|lF4I-A 

uv excision repair protein protein rad23 homolog afragment: c-terminal uba domain;Mutant 

• This hit is scoring at : 9e-06 (expectation value) 

• Alignment length (overlap) : 47 

• Identities : 48 % 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

• Database searched : nrdb 

Q: 357 EEREAIQRLEGMGFNRELVLEVFFACNKDEELTANYLLDHGHEFDDQ 4 03 

:E:EAI:RL: . :GF. . .LV: : . : FAC. K:E. L. AN: LL . . .FDD: 
H: 1 QEKEAIERLKALGFEESLVIQAYFACEKNENLAANFLL--SQNFDDE 45 



Legend of Alignment 

: positive score 

score between -2 and 0 
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abstract: The DNA repair protein HHR23A is a highly conserved protein that functions in nucleotide 
excision repair. HHR23A contains two ubiquitin associated domains (UB A) that are conserved in a number 
of proteins with diverse functions involved in ubiquitination, UV excision repair, and signaling pathways 
via protein kinases. The cellular binding partners of UBA domains remain unclear; however, we previously 
found that the HHR23A UBA(2) domain interacts specifically with the HIV-1 Vpr protein. Analysis of 
die low resolution solution structure of HHR23A UBA(2) revealed a hydrophobic loop region of the 
UBA(2) domain that we predicted was the interface for protein/protein interactions. Here we present 
results of in vitro binding studies that demonstrate the requirement of this hydrophobic loop region for 
interaction with human immunodeficiency virus (HIV-1) Vpr. A single point mutation of the Pro at residue 
333 to a Glu totally abolishes the binding of HIV-1 Vpr to UBA(2). High resolution NMR structures of 
the binding deficient UBA(2) mutant P333E as well as of the wild-type UBA(2) domain were determined 
to compare tlie effect of this mutation on the structure. Small but significant differences are obsen^ed 
only locally at the site of the mutation. The biochemical and structural analysis confirms the function of 
the HHR23A UBA(2) GFP-loop as the protein/protein interacting domain. 



HHR23A is one of the human homologues of the yeast 
Rad23 and Rlip23 proteins that functions in nucleotide 
excision repair (7-i). The biochemical function of HHR23A 
is poorly understood. In yeast, rad23 mutants are sensitive 
to UV light {4). Rad23 promotes the assembly of a large 
multiprotein complex involved in nucleotide excision repair 
(5). In Saccharomyces cerevisiae, Rad23 is complexed with 
Rad4. Similarly, both of the human homologues termed 
HHR23A and HHR23B interact widi the human ortholog, 
the Xeroderma pigmentosum group C (XPC) protein and 
promote its activit>' in in vitro DNA repair assays (J, 6-9). 
In vitro studies have shown that the Rad23/Rad4 complex 
preferentially binds to damaged DNA (70, 77). Studies in 
yeast indicate that Rad23 is required for both transcription- 
coupled DNA repair and efficient global genome repair (72). 

All of the Rad23 homologues share a common domain 
structure, including a ubiquitin-like domain at the N-terminus 
and two copies of a highly conserved domain termed the 
ubiquitin associated domain (UBA) located in the middle 
[UBA(l)] and at the C-terminus [UBA(2)] of the protein (6) 
(Figure 1). The N-terminal ubiquitin-like (UBL) domain 
interacts with the 26S proteasome, as does that of HHR23 A 
and B, suggesting a role for Rad23 in protein degradation 

' Tliis work was supported by NIH Grant A143190 to l.S.Y.C. and 
J.F. 

* To whom correspondence should be addressed. E-mail: feigon® 
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Figure 1: Schematic representation of die domain structure of 
HHR23A. The shaded boxes indicate the domains that have been 
identified by sequence analysis and include an N-terminai ubiquitin- 
hke domain and two copies of highly conserved ubiquitin associated 
domain (UBA) which are designated UBA(l) and UBA(2). 

via the ubiquitin- proteasome pathway {13-15). In yeast, 
the UBL domain is required for function (2). UBA domains 
are approximately 45 amino acids long and are found in 
many proteins including those involved in the ubiquitination 
pathway, UV excision repair, and cell signaling (76). The 
C-terminal UBA domain of HHR23A has very recently been 
shown to interact with the human DNA base excision repair 
protein 3-methyladenine DNA glycosylase (MPG) (77). 

We previously found that the human immunodeficiency 
virus (HIV-1) Vpr protein binds specifically to the UBA(2) 
domain of HHR23A (7^?). The Vpr protein is a 96-amino 
acid gene product of HIV- 1 that is incorporated into virions 
(79). Vpr plays a role in several aspects of the viral life cycle 
including nuclear import of the preintegration complex, 
transactivation of transcription, induction of cell cycle arrest 
at the G2/M checkpoint, and subsequent apoptosis (20-24). 
These functions are mediated by distinct domains of Vpr. 
Residues within both the N- and C-termini are implicated 
in mediating cell cycle airest (25, 26), Selection for 
maintenance^of wild-type Vpr occurs in chimpanzees and 
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humans infected with HIV-1. supporting the idea that Vpr 
confers an important selective advantage to the virus (27). 

Vpr-induced cell-cycle airest and apoptosis of HIV-1 
infected cells would have a profound effect on T-cells, the 
primary target of HIV-L contributing to the immune 
deregulation associated with AIDS (2S). HIV-1 infected 
CD4"'" T-cells would fail to undergo activation and clonal 
expansion. Vpr-mediated apoptosis would contribute to 
depletion of the 004"^ cells that is seen in HIV-infected 
individuals. Therefore, Vpr may be an important viral protein 
to target in the development of therapeutic reagents for 
treatment of HIV-1 -infected individuals. In addition to 
finding that Vpr interacts specifically with the UBA(2) 
domain of HHR23A, we previously demonstrated that 
overexpression of HHR23A or a fragment encompassing the 
UBA(2) domain would partially alleviate cell cycle arrest 
mediated by Vpr (IS). This suggests a significant functional 
role for the interactions between Vpr and HHR23A. We 
hypothesized that Vpr may mediate cell cycle an-est and/or 
apoptosis by binding to the UBA(2) domain of HHR23A, 
thereby leading to deregulation of noiTnal cellular processes 
involved in the control and signaling of DNA repair. Thus, 
an understanding of the su-uctural requirements for interaction 
between Vpr and UBA(2) may not only provide insight into 
the role of HHR23A in Vpr function but may also lead to 
strategies for the development of therapeutic reagents that 
would interfere with Vpr function. 

We have previously reported a low resolution structure 
of UBA(2) of HHR23A determined by homonuclear NMR 
spectroscopy (29). The strucmral motif of the UBA(2) 
domain is composed of three a-helices folded around a 
hydrophobic core. Analysis of the NMR structure revealed 
an unusually large hydrophobic area of the protein that we 
predicted to be an interface for protein/protein interactions 
and might be the Vpr-interacting region of the HHR23A 
UBA(2) domain. This hydrophobic area of the UBA(2) 
domain is in the vicinity of the GFP-loop (asjGFPass) which 
forms a sharp turn between a-helices 1 and 2. To further 
characterize the amino acids that are critical for interaction 
with HHR23A UBA(2), synthetic peptides derived from the 
UBA domains of S. cerevisiae (2) and an unrelated protein 
c-cbi {30) that is involved in signal transduction in hemato- 
poietic cells were assayed for binding to Vpr in vitro. The 
amino acid sequence of these UBA domains differs at and 
around the GFP-loop. The results indicate that the Pro within 
the GFP-loop of the HHR23A UBA(2) domain is required 
for interaction with Vpr. Further studies with mutant peptides 
demonstrate that the Pro within the GFP-loop is necessary 
but not sufficient for binding to Vpr. Furthermore, the 
HHR23A UBA(l) domain and the UBA domain from c-cbl 
do not interact with Vpr, suggesting that the interaction 
between the HHR23A UBA(2) domain and Vpr is a highly 
specific one. High-resolution structures of both the wild type 
and the P333E mutant of UBA(2) were determined by NMR 
to address the question of how this mutation affects the struc- 
ture and/or liydrophobic surface of the protein. Comparison 
of wild-type and P333E mutant structures reveals that subsd- 
tution of tlie Pro residue 333 with a Glu results in changes 
in the shape and charge of the hydrophobic patch predicted 
to be the Vpr-interacting domain. These changes can account 
for the loss of the ability to bind Vpr. The structural analysis 
of a Vpr-interacting protein and idendfication of a Vpr-inter- 



acting interface provide a rationale for the future development 
of therapeutic reagents that interfere with Vpr funcdon. 

^L\TERIALS AND METHODS 

Expression of GST Fusion Proteins in Baaerici. The HIV- 1 
Vpr GST fusion protein was expressed and purified as 
described v^ith some modifications (76*). Briefiy. the HIV- 
iNu-3 coding sequence, inserted into the pGEX-KG expres- 
sion plasmid, was expressed in Escherichia coll BL21 (DE3). 
Following disruption of the cells by sonicaiion and centrifu- 
gatibn, the cells were dialyzed overnight at 4 against PBS. 
The dialysate was then applied directly to a prepacked 
glutathione sepharose column (Pharmacia Biotech) and 
purified according to the manufacturer's instructions. Frac- 
tions were collected and analyzed by SDS-polyacrylamide 
gel electrophoresis (PAGE). Proteins were visualized by 
staining the polyacrylamide gels with Coomassie Blue or 
by transl'erring the proteins to nitrocellulose and staining with 
Ponceau S. Tliese assays confirmed that all fractions were 
free of contaminating bacterial proteins. Those fractions 
containing tlie bulk of the GSTVpr fusion protein were 
pooled and dialyzed overnight at 4 °C against PBS. Protein 
concenti-ation was determined by the Bradford method. 
Purified proteins v/ere stored at -70 ^C. 

Synthesis and Preparation of Peptides. Peptides corre- 
sponding to UBA domains were biochemically synthesized 
by Fmoc (fluorenylmethoxycarbonyl)/r£'rr-buiyl based solid- 
phase peptide chemistry method on an ABI 431 A (Perkin- 
Elmer) pepnde synthesizer utilizing a single coupling 
program to carry out the chain assembly. The crude peptide 
was purified to 95% homogeneity by preparative reverse- 
phase HPLC on a Vydac CI 8 column. The composition and 
purity of each peptide was assessed using electro-spray 
ionization (ESI) mass spectrometr}^ and amino acid analysis. 
For detection purposes, all pepddes were synthesized with 
a biotin molecule at their N-termini. The sequences of the 
peptides used in these studies are listed below with mutations 
indicated in boldface type. 

HHR23A UBA(2): QEKEAIERLKALGFPESLVIQAYF- 
ACEKNENLAANFLLSQNFDDE; HHR23A UBA(2)P333E: 
QEKEAIERLKALGFEESLVIQAYFACEKNENLAANF- 
LLSQNFDDE; HHR23A UBA(2)333GFP/ERD33< QEKEAI- 
ERLKALGFERDLVIQAYFACEKNENLAANFLLSQNF- 
DDE;HHR23AUBA(I): SEYETMLTEIMSMGFERERVV- 
AALRASYNNPHRAVEYLLTGIPGS; 5. cerevisiae UBA- 
(2): EDDQAISRLCELGFERDLVIQVYFACDKNEEAA- 
ANILFSDHAD: 5. cerevisiae UBA(2)E370P: EDDQAIS- 
RLCELGFPRDLVIQV^TACDKNEEAAANILFSDHAD; S. 
cerevisiae UBA(2)37oPES372: EDDQATSRLCELGFPESL\TQ- 
VYFACDKNEEAAANILFSDHAD; UBA c-cbl: SPQLSS- 
EIENLMSQGYSYQDIQKALVIAQNNIEMAKNILRE. 

Samples for binding assays were prepared by dissolving 
the purified lyophilized peptide in 10 mM Hepes, pH 6.8 (1 
mg/inL). In some cases, the pH was adjusted to 7.0 by the 
addition of 1 N NaOH. Aliquots were stored at -70 °C and 
thawed just prior to use. 

In Vitro Binding Assay. This assay is a slighdy modified 
version of a binding assay that we have previously developed 
and tested with all of the relevant controls (18). Glutadiione 
Sepharose 4B (Pharmacia Biotech) was prepared by washing 
0.5 mL of the suspension obtained from the manufacturer 
three times in prebinding buffer (100 mM NaCK 25 mM Tris- 
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HCl [pH 7.5], 0.1% Triton X-100) (1 mL/wash) and then 
resuspending it in prebinding buffer in a total volume of 2 
niL. GSTVpr or GST (10 wg/reaction) was added to 100 i^L 
of the glutathione Sephiu'ose suspension. After samples were 
incubated at room temperature (30-60 min), prebound 
complexes were washed once in prebinding buffer (I mL/ 
reaction) and then twice with buffer A (20 mM Tris-HCl, 
pH 7.5, 100 mM NaCl, 5 mM DTT). Peptides were diluted 
4-fold in buffer A, and 50 /,ig was added to each reaction. 
After a 30-nnin incubation at room temperature, samples were 
washed three times in buffer B (20 mM Tris-HCl, pH 7.5, 
300 mM NaCl, 1 mM DTT, 0.5% NP40) (1 niL/wash)' 
Samples were then incubated in blocking buffer [1 x TNT 
(20 mM Tris, 500 mM NaCl, pH 7.5. 0.3% Tween 20 (v/v), 
4.0% nonfat dry milk (w/v), 1.0% BSA (w/v)] (0.2 mL/ 
sample) for 30 min at room temperature, and then washed 
three times in buffer B. After a 30- min incubation in a 
streptavidin horseradish peroxidase conjugate (Amersham 
Life Science Inc.) [diluted 1/5000 in TNT that was 0.2% in 
BSA (w/v)], samples were washed three times in buffer B. 
1-Step Turbo TMB-ELISA solution (Pierce Chemical Co.) 
was then added to each sample (0.25 mL/sample). After a 
lO-min incubation at room temperature, 150 juL of sulfuric 
acid (1.2 N) was added to each sample, and the absorbance 
was read at 450 nm in a spectrophotometer. 

Preparation of the UBA(2) Domain. A DNA fragment 
encoding the UBA(2) domain (residues 319-363) was 
amplified by PCR from full-length HHR23A gene (J 8) and 
inserted into the pGEX-2T vQctor (Phan-nacia) between the 
BamHl and EcoRl restriction sites. £. coli strain BL21(DE3)- 
pLysS was then transformed with the plasmid, and the 
transformed bacteria were cultured in either LB or M9 media. 
To prepare ^^N- or ^^N- and »^C-labeled protein, '^NH4CI or 
[^^Celglucose was used as sole nitrogen or carbon source in 
the growth medium. Expression of GST-UBA(2) fusion 
protein was induced with 1 mM isopropyl-/3-D-thiogalacto- 
pyranoside when the absorbance of the culture reached 
approximately 0.7 optical density units at 550 nm. The 
temperature was lowered from 37 to 30 ''C during the 
expression to increase the yield of soluble fusion protein. 
The cells were harvested 3 h (4.5 h for the preparation of 
labeled protein in M9 medium) after induction and resus- 
pended in 10 mM Tris-HCl, pH 8.0, 1 mM EDTA, 0.375 M 
sucrose (15 mL/L of culture). Cells were lysed by adding 1 
vol of BugBuster (Novagen), 1 mg/mL lysozyme (Sigma), 
and 0.5 U of Benzonase (Novagen). The GST fusion protein 
was cleaved using biotinylated thrombin (Novagen). The 
protease was then removed with streptavidin agarose, and 
the UBA(2) protein was finally purified by an anion- 
exchange chromatography step (high performance Q sepharose, 
Pharmacia). The final purified UBA(2) protein consisted of 
residues 3 1 9-363 of the HHR23A protein and an N-terminal 
Gly-Ser pepdde derived from the expression vector. Typi- 
cally, between 5 (^^N- and ^^C-labeled) and 12.5 mg ('-'N- 
labeled) of purified UB A(2) protein was obtained from 1 L 
of M9 medium. Inidal NMR studies for the wild-type protein 
as well as the NMR analysis of the UBA(2) mutant P333E 
were performed using peptides derived from chemical 
synthesis as previously described {29). 

NMR Analysis and Structure Calculation of the Wild-Type 
UBA(2) Domain and the Mutant P333E. NMR samples of 
wild-type UBA(2) contained between 1.5 and 3 mM protein 
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in 50 mM sodium phosphate, pH 6.5, 100 mM sodium 
chloride, 2 miVI dithiodireitol-Js (DTT), 0.02% sodium azide, 
and 5% -H2O. All spectra were acquired at 300 K with a 
Brulcer DRX 500 or DRX 600 spectrometer. A series of 
tliree-dimensional experiments ('-^N-HSQC-NOE-HSQC. '^N- 
HSQC TOCSY. CBCA(CO)NR CBCANH, HBHA(CO)- 
NH, HBHANH, HCCH-COSY. and HCCH-TOCSY) were 
acquired for the assignments of the 'H. '-^N, and '-^C 
resonances (31). All NMR data was processed using XWTN- 
NMR (Bruker) and analyzed with the software XEASY (32). 
Distance informadon was obtained from '^N- or '^C-edited 
3D NOESY experiments as well as a series of 2D NOESY 
experiments with mixing times ranging from 80'to 300 ms. 
NOE connectivines obtained from 3D NOESY experiments 
were categorized into strong, medium, weak, and very weak, 
corresponding to upper jimits for proton-proton distances 
of 3.0, 4.0, 4.5, and 5.0 A, respecdvely. 2D NOESY spectra 
were analyzed and integrated using the softvv'are packages 
XEASY (32) and SPSCAN (33). No stereospecific assign- 
ments of methylene protons or methyl groups were used. 
Upper limit distance restraints were then generated from the 
NOE volumes using DYANA (34). In" total, 779 NOE- 
derived distance restraints (173 intraresidue, 196 sequential, 
219 medium range, and 191 long range) were used for the 
wild-type UBA(2). In the final stages of refinement, 17 
hydrogen bonds were introduced based on a regular pattern 
of NOEs of the type Hai%+3 and HaiNH,:+3, as well as carbon 
and proton chemical shift information suggesting a regular 
a-hehx. 

The proton resonances of the UBA(2)P333E vv'ere assigned 
by analysis of a series of 2D ^H NMR experiments composed 
of TOCSY, DQF-COS Y, and NOESY. NOE connecdviues 
were derived from a series of 2D NOESY experiments 
acquired with mixing dmes ranging from 100 to 350 ms. 
Distance restraints were generated as described above. A total 
of 826 NOE-derived distance restraints (204 intraresidue, 182 
sequendal, 226 medium range, and 214 long range) were 
used for the structure calculadon of UBA(2)P333E. Although 
almost all NOE-derived distance restraints were derived from 
the 2D NOESY data acquired on the mutant sample, an 
addidonal 148 distance restraints were taken from the wild- 
type UBA(2) 3D data set due to spectral overiap in the 2D 
NOESY spectra of the mutant. All of these were located 
v/ithin the hydrophobic core and were hn from the site of 
mutation. Idendcal proton chemical shifts between the wild- 
type and die mutant protein indicate that the structural 
environment for those protons is the same, and use of those 
restraints does not bias the structure of the mutant protein. 

The structures of wild-type UBA(2) and its mutant P333E 
were calculated from an extended peptide conformation with 
a simulated annealing protocol using XPLOR (35). The 
simulated annealing protocol consisted of 35 ps dynamics 
at 2000 K (integration dme steps of 5 fs) and 35 ps slow 
cooling (10 K/cycle). Final minimizadon was performed 
using 50 steps of Adopted Raphson Newton algoridim 
(Quanta98, MSI). For each protein, 100 structures were 
calculated, of which 18 (wild-type) and 21 (mutant P333E) 
were selected based on lowest total and NOE energy. 

Coordinates for the 18 lowest energy structures of UBA- 
(2) (accession code IDVO) and UBA(2)P333E (accession 
code 1F4I) have been deposited in die RCSB protein data 
bank. 
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Figure 2: Schematic of the in vitro binding assay (18). Binding 
reactions were perfonned in the following manner. Ten micrograms 
of GSTVpr or GST bound to glutathione sepharose was mixed with 
50 f,i» of biotinylaied UBA(2) peptide and incubated at room 
temperature for 1 h. GST-containing complexes were selectively 
recovered and incubated with streptavidin conjugated to horseradish 
peroxidase. The presence of bound biotinylated peptide was 
determined by reading the absorbance at 450 nm after the addition 
of tetramethyl benzidine substrate solution (1-Step Turbo TMB- 
ELISA, Pierce Chemical Co.) and sulfuric acid. 

RESULTS AND DISCUSSION 

Binding of Vpr to UBA Domains Is Specific to the UBA- 
(2) ofHHR23A, Our previous studies showed that Vpr binds 
specifically to UBA(2) of HHR23A and HHR23B, Since the 
UBA domain is a common structural motif, the binding to 
Vpr of a variety of other UBA domains was tested using a 
binding assay based on one that we have previously described 
{18) (Fi gure 2). Binding of GSTVpr-associated proteins was 
detected by ELISA. The amino acid sequences and alignment 
of the peptides that were tested are shown in Figure 3, panel 
A. The results of the in vitio binding assays using peptides 
derived from the internal UBA domain of HHR23A [UBA- 
(1)], 5. cerevisiae Rad23A UBA(2), and die UBA domain 



of an um-elated protein, c-cbl {30). which is involved in signal 
transduction in hematopoeitic cells, are shown in Fieure 3, 
panel B. None of these peptides show binding to GSTVpr 
- in this assay. The 5. cerevisiae Rad23 UBA(2j domain 
contains several hydrophilic residues in the vicinity of the 
GFP-loop. Western blot analysis (29) also did not show any 
detectable binding of 5. cerevisiae Rad23 UBA(2) to 
GSTVpr (unpublished results). The absence of binding to 
the UBA domains of c-cbL S. cerevisiae Rad23, as well as 
the UBA(l) of HHR23A indicates that the interaction 
between the HHR23A UBA(2) domain and Vpr is a highly 
specific one and not simply the result of the promiscuous 
interaction of Vpr with a domain found witliin many proteins 
with diverse cellular functions. Furthermore, these results 
support the prediction that the presence of hydrophilic 
residues at and around the region of the GFP-loop might 
preclude Vpr binding and lend support to the proposal that 
the GFP-loop is the interface for Vpr interaction. 

Identification of Critical Residues on HHR23A for Binding 
to Vpr. As a next step in characterizing the amino acids tha^i 
are critical for interaction with HHR23A UBA(2), we 
performed in vitro binding studies with synthetic mutant 
peptides derived from the UBA(2) domains of 5. cerevisiae 
and HHR23A. A comparison of the amino acid sequence of 
the UBA domains in the vicinity of the GFP-loop indicated 
the presence of several charged residues in the S. cerevisiae 
UBA(2) domain at and around the GFP-loop that might 
contribute to the loss of binding to GSTVpr. In particufar. 
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Figure 3: (A) Amino acid sequences and alignment of UBA domains: HHR23A UBA(2), S. cerevisiae Rad23A UBA(2), c-cbl UBA, and 
HHR23A UBA(n. The amino acids that comprise a-helices 1, 2. and 3 are indicated above the sequence. The sequence of amino acids are 
colored green (hydrophobic), red (negatively charged), orange (polar uncharged), and blue (positively charged) (B) Results of bindinc^ 
assays with HHR23A UBA(2), HHR23A UBA(2) P333E, S. cerevisiae Rad23A UBA(2), c-cbl UBA, knd HHR23A UBA(l) (C) Results 
of binding assays with HHR23A and S. cerevisiae Rad23 UBA(2) mutant peptides. Binding reactions were performed as described in the 
Materials and Methods section. Data shown are values obtained from duplicate samples from a typical experiment. 
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Figure 4: Stereoviews of superpositions of the lowest energy structures of (A) wild-type UBA(2) (18 structures) and (B) UBA(2)P333E 
(21 structures). Backbone atoms for residues 320—357 are shown. The C-terminal residues 358-363 are disordered and are not shown 
here. Side chain atoms for the residues contributing to the hydrophobic core are displayed in green. 



we predicted that the Pro residue at position 333 of HHR23 A 
UB A(2) might be critical for interaction with Vpr. To assess 
the contribution of this residue to the HHR23A UBA(2)/ 
Vpr interaction, a mutant HHR23A UBA(2) peptide contain- 
ing a glutamic acid residue at position 333 [HHR23A 
UBA(2)P333E)] was chemically synthesized and assayed for 
binding. This P333E substitution completely abolished 
binding of the peptide to GSTVpr (Figure 3, panel C). The 
HKR23A UBA(l) domain contains a glutamic acid residue 
at the position corresponding to position 333 in the HHR23 A 
UBA(2) domain, and the c-cbl UB A domain contains a serine 
at that position. These results indicate that the proline at 
position 333 of HHR23A UBA(2) is necessary for interaction 
with GSTVpr. 

Further studies with mutant synthetic UBA(2) peptides 
demonstrate that the Pro at position 333 is critical for binding 



but not sufficient on its own for the interaction of UBA(2) 
with GSTVpr. Neither an E370P substitution nor a 370PES372 
substitution made within die 5. cerevisiae UBA(2) confers 
upon those peptides the ability to bind GSTVpr (Figure 3, 
panel C). These in vitro binding results demonstrate the 
importance of the hydrophobic GFP-loop region for interac- 
tion of UB A(2) with Vpr. 

Refined High-Resolution Structures of Wild-Type UBA- 
(2) of HHR23A. To further explore the role of the critical 
P333 residue on binding of UBA(2) to Vpr, we undertook 
to determine the structure of UBA(2) P333E. During the 
assignment of this mutant protein using 2D proton NMR data 
recorded from a synthetic peptide, we discovered a mismatch 
between the assignments for the wdld-type UBA(2) (29) and 
the mutant. To resolve the conflict, the structure of HHR23 A 
UBA(2) was redetermined using heteronuclear NMR spec- 
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troscopy on a '^C-,'^N-labeled protein. The high resolution 
structure was necessary for the structure comparison, to 
analyze the small structural changes that might be caused 
by die point mutation of Pro333 to Glu. For the preparation 
of labeled protein, the cDNA of UBA(2) encoding die 
residues 319—363 was cloned into the expression vector 
pGEX-2T. Analysis of the isotope-edited 3D data revealed 
a misassignment in the original data due to spectral overlap. 
Specifically. NOEs were incon'ectly assigned between the 
side chain of Leu330 and the aromatic ring protons of 
Tyr341. The correct NOE connectiviries are found to be 
between the side chains of Ile324 and Tyr341 as well as 
Glu34S and Tyr341. For the highly refmed structure of the 
wild-type UBA(2) domain 779 NOE-derived distance re- 
straints (vs 380 in the previously reported structure) were 
obtained. Simulated anneahng calculations were employed 
to generate an ensemble of 18 structures (Figure 4, panel 
A) consistent with the NMR data. The structures exhibit good 
covalent geometry and have no NOE violations greater than 
0.5 A. Figure 4, panel A, shov/s the quality of the calculated 
structures with a root-mean-square (rms) deviation between 
the atomic coordinates of the backbone and all heavy atoms 
with respect to the average coordinates for the residues within 
the three a-helices (residues 320-330, 337-344 and 350- 
357) of only 0.178 ± 0.037 A and 0.686 ± 0.064 A, 
respectively. Complete restraint and structural statistics are 
presented in Table 1. 

In the refined structure, the three helix bundle is more 
compact and die orientadon of the interhehcal angles changed 
slightly (Figure 4, panel A). Interhelical angles were 
determined using the program RIBBONS2.6 (36). Helices 
1 and 2 are arranged at an angle of 120° (equivalent to -60°) 
and helices 2 and 3 exhibit an angle of 126° (or -54°). These 
values are close to the optimum of 52° reported for a "knob 
in hole" pacldng of the side chains. In contrast, the 
interhelital angle between heUces 1 and 3 is only about 30°. 
A tight "knob in hole" packing for the side chains of these 
two helices is not requu-ed due to the larger distance between 
them. The resulting change in the structure of the UBA(2) 
domain is mainly a shift of the first a-helix of the three- 
helical bundle along its axis of about 3 A. The orientation 
of the first helix with respect to the other two helices is only 
slightly affected (about 15°). 

The structure of the UBA(2) domain is well-ordered from 
residues Glu320 to Ser357. In contrast to the low resolution 
structure of UBA(2) (29), a full set of NOEs of the type 
HaiH^i+3 and HaiNHj+3, were found for residues Glu320 to 
Leu330, showing that the first helix is extended to the 
N-terminal end of the UBA(2) domain. The C-terminal 
residues Asn359 to Asp363 show no medium or long range 
NOE connectivities and are disordered. Carbon and proton 
chemical shifts for these residues are similar to those found 
for random coil conformadons. The first loop connecting 
helices 1 and 2 is well-ordered, and the residues Gly331 to 
Leu336 show many NOE connectivities to residues located 
in the hydrophobic core. NOE connectivities for P333 clearly 
indicate that the Pro residue adopts a trans conformation. 
The second loop, consisting of the residues Glu345 to 
Asn349, is far more exposed on the surface. Line broadening 
for several residues in this loop, namely Lys346 and Glu348, 
indicates that this loop is dynamic, likely due to slow 
chemical exchange processes. 



Table 1 : Input Data for Structure Calculation and Structural 
Statistics UBA(2) [18 Structures] and UBA(2)P333E [21 Structures] 



Distance Restraints 


UBA(2) 


UBA(2>P333E 


total 


796 


843 


NOE-derived 






intraresidue 


173 


204 


sequential (|/ — j\ = 1 ) 


196 


1S2 


medium-range (|/ ~ jj < 4) 


219 


226 


long-range (|/ — /j > 4) 


191 


214 


hydrogen bonds 






medium-range 


17 


.17 


(within-helices) 







Structure Statistics 



RMSD from experimental 
distance restraints (A) 

RMSD from idealized geometry 
bonds (A) 
angles (deg) 

energies (kcal mol"') 

RMSD from avg structure (A) 

backbone atoms in 

secondary structure* 

backbone atoms for 

residues 3—9 

all heavy atoms in 

secondary structure* 

all heavy atoms for 

residues 3—39 
Procheck analysis 

residues in most favored 

region (%) 

residues in additional 

allowed region (%) 

residues in generously 

allowed regions (%) 

residues in disallowed 

regions (%) 



0.047 ± 0.001 0.052 ± 0.0003 



0.005 ± 0.00004 0.005 ± 0.00005 
0.616 ± 0.007 0.617 ±0.004 



11.7 ±0.4 
17.1 ±0.3 
74.9 ± 1.6 
-262± 9 

0.18 ±0.04 

0.21 ±0.06 

0.69 ± 0-06 

0.72 ± 0.07 

79.7 
15.9 
3.7 
0.8 



9.8 ± 0.5 
IS.3 ±0-4 
74.8 ± I.O 
-270 ± 6 

0.12 ±0.03 

0.16 ±0.07 

0.66 ± 0.08 

0.71 ±0.07 

76.1 
20.9 
2.6 
0.4 



° Lennart— Jones energy was calculated after the accepted structures 
were subjected to a final minimization (50 steps Adopted Raphson 
Newton algorithm, QU.ANTA98) using the CHARMM force field and 
the NOE-derived distance restraints. * Secondary structure is composed 
of residues 3—12, 19—25. and 32—39. These residues are numbered 
321-330, 337-343, and 350-357 in HHR23A. 

One interesting feature of the UBA(2) domain is the 
presence of large hydrophobic patches at the protein surface 
(Figure 5). Several hydrophobic residues that show contacts 
with the hydrophobic core are partially exposed on the 
surface. The largest area is made up of residues Leu327, 
Ala329, and Leu330 in helix 1, residues Gly331, PheSSli 
Pro333, and Leu336 in the loop 1, and residues Ala35i and 
Leu355 in helix 3, covering a total of about 596 (epitope 
1). This corresponds to approximately 18% of the total 
solvent accessible surface area of 3288 A-. A second 
hydrophobic region (epitope 2) located on the opposite side 
of the UBA(2) molecule consists of the residues Ala323, 
Ile324 (helix 1), Ile338, Phe341, Ala342 (helix2), Leu349 
(helix3), and Phe359 (C-tenninus). The total surface area is 
555 A-, but in contrast to the hydrophobic epitope 1, this 
area is not as compact and the residues do not form a 
consecutive epitope. Such large hydrophobic surface patches 
are rather unusual and as previously noted are likely binding 
sites for other proteins. The site of interaction of HTV-l Vpr 
with the UBA(2) domain was therefore suggested to occur 
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between the hydrophobic region of Vpr and epitope 1. 
Epitope 2 is a potential site for other protein-protein 
interactions in the XPC complex. 

Comparison of the Structures of Wild-Type UBA(2) and 
Mutant P333E. As discussed, a single point mutation of the 
residue Pro333 to a Glu has been shown to completely 
abolish the binding of HIV-1 Vpr to UBA(2). To investigate 
whether the impaired binding is due to a change in structure 
or the change in a critical interaction with the amino acid 
side chain, we determined the structure of the UBA(2) mutant 
P333E (Figure 4, panel B). A synthetic peptide was used 
for the NMR analysis. For the final structure calculations, 
826 NOE-derived distance restraints were used, of which 
204 are intraresidual, 1 82 are sequential, 226 are medium 
range, and 214 are long-range NOE connectivities (Table 
1). Twenty-one of 100 calculated structures were selected 
based on lowest total and NOE energy. Figure 4, panel B, 
shows the high structural precision of 0.121 ± 0.031 A for 
the backbone atoms and 0.661 ± 0.080 A for the heavy 
atoms for the three helices. The high-resolution structures 
for the wild-type and mutant UBA(2) domain were of similar 
resolution enabling us to do a detailed structure/function 
analysis. 

The coordinates for the backbone atoms within the three 
helices of the wild-type UB A(2) and mutant P333E are nearly 
identical, showing that the only structural changes between 
the wild-type and mutant UBA(2) occur locally around the 
mutated residue 333 (Figure 4), Comparison of the proton 
chemical shifts of the wild-type and the mutant resonances 
also indicates that there is no change in the overall structure 
(data not shown). Differences in the proton chemical shift 
larger than 0.1 ppm are observed for the residues close to 



the site of mutation, i.e., for the residues Glv331, Phe332 
Glu334, and Ser335. 

Figure 6 shows the superposition of the structural ensemble 
of wild-type UBA(2) and the mutant P333E for the residues 
Leu327 to Val337 around the GFP-loop. The structures were 
superimposed using the backbone atoms within die three 
helices. Structural changes within loop 1 between wild-type 
and mutant are small but significant. The root- mean-square 
deviation to the mean within loop 1 is about 0.2 A for wild 
type and P333E. However, the difference in the positions of 
the amide nitrogens of residues 333 and 334 are about 1.2 
and 0.8 A respectively. This clearly indicates that the 
sti-uctural changes observed are significant within the ac- 
curacy of the NMR data. The largest change in main chain 
conformation occurs at the site of mutation. The backbone 
torsion angles for the residue Phe332 preceding the mutated 
residue are different for the wild-type and the mutant protein. 
The difference of the (p angle for Phe332 angle is small, 
about -152° for the wild-type and -168° for the mutant. In 
contrast, the ip angle for Phe332 is drastically altered from 
about -138° for the wild type to -77° for the mutant P333E. 
This results in a different main chain conformation for the 
loop 1 for wild type and mutant from residue Phe332 to 
residue Ser335. Starting with residue VaI337 the main chain 
and side chain coordinates can again be superimposed within 
the error limits of the NMR data. The differences in main 
chain torsion angle also lead to different packing for the 
■residues in contact with residue 333, Pro333 of the wild- 
type UBA(2) shows contacts to other residues participating 
in the hydrophobic core, namely, residues Phe332 and 
Leu336. The x\ torsion angle of residue Phe332 is changed 
from —60° to — 52°» and the xi torsion angle is altered from 
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Figure 6: (A) Comparison of the structure of loop 1 (residues Leu327 to VaI337) of UBA(2) (black) and UBA(2)P33^.E (blue) (B) Same 
as in panel A but rotated around the r-axis by approximately 90^ The site of mutation is illustrated bv different colors for the respect "ve 
side chams. red for the w.id-type Pro..... and green tor the mutant Glu333. The superposition of the structures reveals that the differences 
between the loop conformations of the wild-type and mutant proteins are larger than the rms deviation within their structure ensembl^^^^^ 
Residues farther away from the mutation, e.g Leu327 and Val337, have identical side chain conformations in the wild-rvpe and mutant 
protein domam struc ures. The miyor structural change occurs at the backbone V' torsion angle for residue Phe332. which leads to a different 
side chain conformation of the Phe residue and a different orientation of the mutated residue 333. tnrrerenr 



107° to 152° for the wild-type vs P333E mutant proteins, 
respectively. Differences of a similar order of magnitude are 
observed for residue Leu336 as well. The torsion angles X\ 
and %2 are changed by about 16° (from 68° for wild-type t^ 
84° for P333E) and 54° (from -172° for the wild-type to 
- 1 18° for P333E), respectively. Residue Leu33(), which has 
no direct contact to residue 333, exhibits only small changes 
for the side chain orientation as an effect of the packing 
against residue Phe332. The side chain torsion angles for 
residues Leu327 and Val337, both of which are packed in 
the hydrophobic core, are identical within the error limits of 
the NMR data. 

Comparison of Structural Data with the Results ofS. cer, 
Rad23A Mutagenesis. A NOESY spectrum of the 5. cerevi- 
siae Rad23A UBA(2) domain showed a very similar NOE 
pattern to the wild-type UBA(2) domain (data not shown). 
This indicates that, like the P333E mutant, it exhibits the 
same overall fold and secondary structure as the HHR23A 



UBA(2). The S. cerevisiae Rad23A UBA(2) domain does 
not bind to Vpr, as discussed. In comparison to the UBA(2) 
domain of HHR23A. the UBA(2) domain of 5. cerevisiae 
differs in the amino acid sequence in the vicinity of the GFP- 
loop, specifically Ala329 is substituted by a negatively 
charged Glu, Pro333 is exchanged by a Glu, the subsequent 
amino acid Glu334 is charge- reversed with an Arg residue, 
and the Ser335 residue is replaced by a negatively charged 
Asp. Analysis of a UBA(2) chimera derived from HHR23A 
and S. cerevisiae demonstrated that the Pro residue within 
the GFP-loop is insufficient for binding to Vpr (Figure 3, 
panel C). Replacement of the Glu residue found at position 
370 in the UBA domain of 5. cerevisiae [equivalent to 
position 333 in HHR23A UBA(2)] with a Pro does not 
restore binding to HTV-1 Vpr, indicating that there are 
additional determinants for the binding of Vpr. Using the 
sequence alignment of Figure 3, panel A, we can predict 
the equivalent hydrophobic epitope 1 for the UBA domain 
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of 5. cerevisiae. This analysis shows thai in addition to the 
change to Glu at position 333, two other residues that are 
part of the hydrophobic patch are different between the two 
UBA domains, i.e., Ala329 and Leii356, which are replaced 
by Gill and Phe. respectively. This change of two small 
hydrophobic residues for a charged and much larger amino 
acid, respectively, is likely to have a significant'^effect on 
the structure of the proposed interaction surface between 
UBA(2) and Vpr. 

SUMMARY AND CONCLUSIONS 

The UBA domains are found in a number of proteins 
besides HHR23A with diverse functions involved in ubiq- 
uitination. DNA repair, and signaling pathways via protein 
kinases (76). One of the unusual features of the UBA(2) 
domain of HHR23A is the presence of large hydrophobic 
patches at the protein surface. The existence of 'these larse 
hydrophobic patches strongly suggests that these may serve 
as binding sites for other proteins. For HIV-1 Vpr. we 
confirmed by site-directed mutagenesis that one of the 
hydrophobic patches encompassing Pro333 is critical for the 
interaction between Vpr and UBA(2) {18). To understand 
the structural basis of the loss in binding upon mutation of 
Pro 333 to Glu, we detennined the structures of the wild- 
type UBA(2) domain and the mutant P333E to high resolu- 
tion. A comparison shows that while the overall fold of the 
UBA domain is not affected by the substitution, there are 
significant changes that occur locally around the site pf 
mutation. The high accuracy of the structures revealed that 
substitution of a Pro residue 333 with a Glu resulted in 
changes of the shape and charge of the hydrophobic patch 
predicted to be the Vpr-interacting domain. These changes 
account for the loss of the ability to bind Vpr. 

The structural and genetic information developed in the 
study should facilitate the idendfication and characterization 
of cellular proteins that bind through UBA. One hypothesis 
that can be addressed with the structural knowledge obtained 
here is that Vpr may compete or alter the interaction between 
UBA(2) and a normal cellular protein involved in DNA 
repair signaling processes. To date, despite their ubiquitous 
occuiTence, only two cellular binding partners for UBA 
domains have been identified. Ubiquithi has been shown to 
bind to the UBA domain of p62. a ligand of the SH2 domain 
of p56'*^^ (37). However, the very low sequence homolosry 
between the UBA domain of p62 and UBA(2) of HHR23A 
does not suggest a similar interaction between UBA(2) and 
ubiquitin. Significantly for this work, HHR23A and B have 
been shown to interact with the 3-methyladenine DNA 
glycosylase (MPG) protein and have been proposed to serve 
as universal DNA damage recognition accessory proteins. 
Furthermore, MPG binds specifically to the UBA(2) of 
HHR23A. It is therefore possible that HIV- 1 Vpr may 
compete with MPG for binding to HHR23A, thus affectinc 
DNA nucleotide excision repair. This provides support for 
our hypothesis that the interaction of HIV-1 Vpr with this 
domain may be important to its functional role in mediating 
cell cycle arrest. * 
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ed annealing was done with additional restraints that define the U305 
and U300 base triples (Table 2). During all calculations the P4-P6 
domain and the docked PI helix were held rigid. In the resulting 
model, there are no van der Waals or chemical geometry violations. 

Note added in proof: While this work was in press, Golden et al?^ 
published a model based upon 5 A crystallographic data of a 
Tetrahymena intron lacking the Pi helix and several other periph- 
eral elements known to stabilize the catalytic core. Within theJ8/7 
region there are substantial discrepancies between the 5 A model 
and the results presented here as well as other biochemical data^ '^. 
In order for J8/7 to form the appropriate base triples with PI and 
P3, it must rotate 180" about the phosphate backbone atASOl and 
A302 and P3 must translate by more than 20 A near U300. This 
suggests that the J8/7 region of the crystallized intron lacking die Pi 
helix may not be preordered for substrate binding. 
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Stmctyre of a hymaDu DWA 
irepaoir proteSin UBA domanin 
that DOTiteracts WDth Vpr 

The HIV-1 protein Vpr is critical for a number of viral func- 
tions including a unique ability to arrest T-cells at a G2/M 
checkpoint and induce subsequent apoptosis. It has been 
shown to interact specifically with the second UBA (ubiqui- 
tin associated) domain found in the DNA repair protein 
HHR23A. a highly evolutionarily conserved protein. This 
domain is a commonly occurring sequence motif in some 
members of the ubiquitination pathway, U V excision repair 
proteins, and certain protein kinases. The three dimension- 
al structure of the UBA domain, determined by NMR spec- 
troscopy, is presented. The protein domain forms a 
compact three-helix bundle. One side of the protein has a 
hydrophobic surface that is the most likely Vpr target site. 

Vpr mediates a number of different functions during the 
HIV life cycle', including nuclear import^ and the induction 
of ceil cycle arrest in the G2 phase^-^ While the exact mecha- 
nism of Vpr in induction of cell cycle arrest has not yet been 
elucidated, it has been shown that Vpr is both necessary and 
sufficient to cause accumulation of cells in G2 phase^. The 
arrested HIV-1 infected T-ceils subsequently undergo cell 
death by apoptosis. It has been hypothesized that this can 
have a significant effect in HIV pathogenesis by preventing 
efficient activation and subsequent clonal expansion of CD4* 



T-cells^-^ The Vpr mediated cell cycle arrest is in some aspects 
similar to the arrest of cells at certain cell cycle checkpoints as 
a result of exposure to DNA damaging agents^ Using a yeast 
two-hybrid assay, it has recently been shown that Vpr also 
interacts with the DNA repair protein HHR23A"'. the human 
homolog of Rad23. HHR23A is an acidic 40,000 H protein 
that shares extensive homology with the protein encoded by 
the Saccharomyces cerevisae RAD23 gene^K The Rad23 protein 
is involved in nucleotide excision repair and is up-regulated if 
the ceil is exposed to DNA-damaging agents'*. All of the 
Rad23 homologs identified to date share a common domain 
structure with a so-called ubiquitin-like region at the N-ter- 
minus and two highly conserved regions of -50 amino acids, 
termed the ubiquitin associated (UBA) domains'^, in the cen- 
ter and at the C-terminus (Fig. la). The 45 amino acid C-ter- 
minal UBA domain (UBA (2)) was identified as the 
Vpr-interacting domain of HHR23A by an in vitro binding 
assay'°. UBA domains are a very common sequence motif in 
proteins of the ubiquitination pathway, UV excision repair 
proteins and certain protein kinases'^. Although the specific 
role of this protein domain is so far not known, it has been 
suggested that UBA domains are involved in conferring target 
specificity to multiple enzymes of the ubiquitination system. 
In order to further our understanding of the interaction of the 
UBA domain with Vpr and other proteins, we have deter- 
mined the structure of UBA(2) of HHR23A using homonu- 
ciear NMR spectroscopy. This is the first three-dimensional 
structure of a UBA domain. 

Withers-Ward et al'° have shown that overexpression of 
both the full-length HHR23A protein and a 179 amino acid 
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UBA(1) UBA linker region UBA(2) 



81 
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Fig. 1 a, Sequence and domain struc- 
ture of HHR23A. The different 
domains that have been identified by 
sequence analysis are indicated as 
shaded boxes, b, Sequences of UBA 
domains used in this study. The amino 
acid positions that are conserved to 
more than 90% in all Icnown UBA 
domains are highlighted. 
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C-terminal derivative. HHR23A(B213). which contains part 
of the UBA(l) domain, the UBA linker region and the entire 
UBA(2) domain results in significant alleviation of HIV-1 
Vpr-induced cell cycle arrest. To further investigate the 
importance of the C-terminal UBA domain in cell cycle 
arrest, constructed a plasmid pXCB213trunc. that 
expressed HHR23A(B213trunc). a C-terminal truncation 
containing a deletion of the entire UBA (2), domain to test in 
the alleviation assay, in v^hich co-expression of Vpr and 
HHR23A(B213) results in partial alleviation of Vpr-induced 
G2 arrest (Fig. 2a, panels II and IV). In contrast, we found 
that cells co-expressing Vpr and HHR23A(B213trunc) were 
unable to alleviate HIV-1 Vpr-induced 02 arrest (Fig. 2a, 
panels IV and V). Thus, removal of the C-terminal 45 amino 
acids that comprise the UBA(2) domain of HHR23A(B213) 
was sufficient to abolish the alleviation of G2 arrest seen with 
the HHR23A(B213) although a stable truncated protein was 

expressed (Fig. 2/)). These results confirm that 

the UBA (2) domain plays a central role in the 
alleviation of Vpr induced 02 arrest and that 
binding of Vpr to HHR23A is a crucial step in 
this process. 

To obtain further information on the UBA(2) 
domain of HHR23A. the structure of a 45 amino 
acid peptide corresponding to residues 319-363 
ofHHR23A (Fig. 1) was determined by homonu- 
clear NMR spectroscopy. The peptide was pre- 
pared by chemical synthesis and dissolved easily 
in the NMR buffer (50 mM potassium phosphate 
buffer at pH 6.5). The one-dimensional NMR 
spectrum of UBA (2) has a dispersed amide 
region indicative of a folded structure. The 
assignments and secondary structure were deter- 
mined from analysis of two-dimensional DQF- 
C0SY»3. TOCSY'^'5 and NOESY spectra^^ 
following established procedures'^ . The five N- 
terminal and C-terminal residues show no signs 
of any stable secondary structure; that is, there 
are no significant NOEs observable and the 
amide protons exchange rapidly with the solvent. 
This effect is also observed in the longer synthet- 
ic peptide with 10 amino acids added to the N- 
terminus (Fig. 1/?, HHR23A-I-10N corresponding 



to residues 309-363 of HHR23A, data not shown) confirming 
that the residues 324-358 form the independently folding 
unit of the UBA motif. Qualitative analysis of the NOESY 
spectra (Fig. 3a) showed that UBA(2) forms a highly a-helical 
structure (Fig. 3/),c) as predicted by secondary structure pre- 
diction programs'8-20. The three-dimensional structure of the 
molecule was determined based on NOE intensities extracted 
from 2D-N0ESY spectra acquired with mixing times of 80 
and 250 ms, using a combined distance geometry and simu- 
lated annealing approach2»-22, A total of 50 structures were cal- 
culated, and the 10 structures with the lowest DYANA" target 
function value were selected for the final analysis. The struc- 
tures are very well defined within the core region of the mole- 
cule (residues 5-42) (Fig. 4a). Input data and statistics of the 
resulting structures are summarized in Table 1. 

The UBA domain folds into a compact three helix bundle 
with the helices oriented at angles of -45** relative to the lon- 



Table 1 Summary of input data and structure refinement parameters 

Number of experimental restraints: 
Distance restraints from NOEs: 
Intra residue 
Sequential 

Medium range (i + 2 to i + 4) 
Long range (> i + 4) 
Number of NOEs per residue (residues 5-42) 
Number of H-bond restraints (2 per bond) 
Total number of experimental restraints; 
R.m.s. deviations from experimental data: 
Number of refined structures 
Average of DYANA target function 
Number of violations > 0.5 A 
Number of violations > 0.3 A 
Average distance restraint violation 
Ramachandran Analysis: 

Residues in favored regions 
Residues in additional allowed regions 
Residues in generously allowed regions 
Residues in disallowed regions 
Coordinate precision (A): 

R.m.s. deviation of backbone atoms (residues 5-42) 
R.m.s. deviation of all heavy atoms (residues 5-42) 



82 
123 
86 
55 
9.4 
34 
380 

10 

1.24 ±0.07 
none 

6 (maximum 0.39 A) 
0.017 ±0.001 A 

68.4% 
26.8% 
4.8% 
0.0% 

1.24 ±0.36 A 
2.22 ± 0.51 A 
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Fig. 2 Abrogation of alleviation of Vpr-induced cell cycle 
arrest by overexpression of a 134-residue C-termina! portion 
of HHR23A, HHR23A(B213trunc). that contains a deletion of 
the C-terminal UBA domain, a, Hela cells were co-transfect- 
ed with either BSVprThy or pXCThy and a 20-fold molar 
excess of either pXC (3.2 ^ig), pXCB213 (4.0 \ig), or 
pXCB213trunc (4.0 \Lg), and analyzed by flow cytometry as 
described^**. Duplicate cotransfections are shown except for 
mock-transfected cells. Panel I, mock transfected cells; panel 
II, BSVprThy and pXC; panel III. pXCThy and pXC; panel IV, 
BSVprThy and pXCB213; panel V, BSVprThy and 
pXCB213trunc. b. Expression of HHR23A(B213trunc) in Hela 
celts. Hela cells were co-transfected with BSVprThy and 
pXCB213 or pXCB213trunc, and analyzed by Western blot- 
ting as described. Lanes 1 and 2 contain lysates from dupli- 
cate samples of cells co-transfected with BSVprThy and 
pXCB213 (shown in (a) panel IV), lanes 3 and 4 contain 
lysates from duplicate samples of cells co-transfected with 
BSVprThy and pXCB213trun c (shown in (a) panel V). 



HHR23A 
B213 B213trunc 
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gitudinal axis of the molecule (Fig. 46). The residues 324-330, 
334-343 and 350-358 form three a-helices, ai. ai and 
respectively (numbering scheme based on full size HHR23A). 
In addition to the structure of UBA (2) from HHR23A dis- 
cussed here, the NOESY spectra of synthetic peptides corre- 
sponding to the sequences of the C-terminal UBA domains 
from HHR23B and yeast Rad23 (Fig. \b) were analyzed. The 
assignments and the NOE distribution confirm that all these 
protein domains have the same secondary structure and over- 
all fold. The pattern of conserved residues (Fig. lb) in the dif- 
ferent UBA domains can largely be explained by their 
participation in helix-helix interactions. The residues Gly 331, 
Phe 332 and Pro 333 form a sharp turn (which we call the 
GFP-loop) with Gly 331 and Phe 332 being highly conserved 
(Gly 331 to 100%. residue 332 is F or Y in 90% of the UBA 
sequences). The side chain of Phe 332 is sandwiched between 
the methyl groups of Leu 355 and Leu 356 connecting the 
GFP-loop between helices tti and az to the C-terminal end of 
helix tts (Fig. 4c). The surface of the UBA domain has an 
uneven distribution of charged and hydrophobic amino acids 
(Fig. 4d.e). This large hydrophobic area on the bottom of the 
helix bundle in the vicinity of the GFP-loop is the most likely 
interface for protein-protein interactions. 



DNA damage and Vpr action both lead to cell cycle arrest at 
the same checkpoint which suggests that the same pathways 
may be used. The binding of Vpr to UBA (2) of HHR23A seems 
to be a crucial step in this process (this paper and ref. 10). The 
N-terminal ubiquitin-like domain of yeast Rad23 has recently 
been shown to interact with the proteosome^^ A similar role 
for the HHR23A and B proteins is likely based on the extensive 
homology between them and Rad23 with respect to sequence 
and function. The linker region of HHR23B has been shown to 
be involved in XPC (Xeroderma pigmentosum group C) bind- 
ing". Thus the Rad23 homologs provide a link between the 
ubiquitin/proteosome and DNA excision repair pathways and 
are potentially involved in the regulation of the assembly of the 
repair complex. In this context it is interesting that native 
Rad23 is ubiquitinated and degraded during the transition 
between the Gl and S phases of the cell cycle^^ It is possible 
that binding of Vpr disrupts this link between cell cycle con- 
trol and DNA repair through the interaction with UBA (2). 
The structure of the UBA(2) domain of HHR23A shown here 
provides a rational basis for designing mutant proteins to test 
and further define the Vpr binding site on HHR23A. Because 
of the multiple functions of Vpr important for HIV-1 replica- 
tion as well as pathogenesis, the development of therapeutics 
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Fig. 3 a, 300 ms 2D WATERGATE-NOESY spectrum of the HHR23A DBA domain 2. The spectrum was acquired at 293 K on a sample containing 1 .0 mM 
synthetic peptide in 450 \i\ 50 mlVI potassium phosphate buffer at pH 6.5. A total of 600 1^ increments of 64 scans each were acquired in States-TPPI 
mode. The spectrum was processed with 2.048*2,048 real points after multiplication with a squared sine bell function shifted by ic/3 and tU2 In F, and 
F2 respectively, b, Summary of NOE contacts for HHR23A UBA(2). The numbering scheme is that of the full size HHR23A. e, NMR derived secondary 
structure of the DBA domains of HHR23A. 



directed against Vpr as a target would provide another line of 
attack against HIV-1 disease. The understanding achieved here 
regarding a critical domain that interacts with Vpr is a first 
step towards meeting this goal. 

Methods 

Cells. HeLa cells were grown in Dulbecco's modified Eagle's medi- 
um (0-MEM) supplemented with calf serum (10%), penicillin (100 
U ml"^) and streptomycin (100 \ig ml-^). At a time of 18-24 h prior 
to transfection cells were plated in six-well plates at a density of 
2x10^ per well. HeLa cells were cotransfected with either 
BSVprThy or pXCThy and -20-fold molar excess of either pXC, 
pXCB213, or pXCB213trunc using Lipofectin (Gibco BRL) as direct- 
ed by the manufacturer. At 48 h after transfection celts were ana- 
lyzed by flow cytometry as described below. 

Mammalian eitpression constructs. The plasmids BSVprThy. 
pXCThy and pXC have been descrlbedi°. pXCB213trunc contains a 
truncation of the C-terminal UBA domain of the HHR23A(B213) frag- 
ment designated UBA(2), and was constructed by insertion of PGR 
amplified HHR23A sequences (nucleotides 589-930) into the XhcA and 
MIlA sites of the expression plasmid pXC. The sequences of the primers 
(provided by DMA Technology Group, Amgen, Boulder, CO) used for 



PGR amplification were: 5'-CCGGTGGAGATGGACTACAAAGACGAT- 
GACGATAAAAGAGCCAGCTAGAAGAAG-3' (forward primer) and 
5'-GGGAGGGGnCA- TGACGGGGTGACCTGGATGTAGT-3' (reverse 
primer). The forward primer contains an Xho\ restriction site, a 
translation initiation site and an IVI2 FLAG epitope tag (boid). The 
reverse primer contains an Mlu\ site at its 5' end. The DMA 
sequence of the Xho\-Mlu\ fragment in pXCB21 3trunc was con- 
firmed by sequence analysis. The HHR23A(B213) expression plas- 
mid pXGB213 encodes the G-terminal portion of HHR23A 
(residues 185-363). The HHR23A(B213trunc) expression plasmid 
pXCB213trunc encodes a truncated G-terminal portion of HHR23A 
(residues 185-318) and contains a deletion of the C-terminal 
UBA(2) domain. The BSVprThy expression plasmid encodes the 
HIV-1 Vpr protein (96 residues) and the murine Thy1.2 protein 
(162 residues), a cell surface marker that allows us to distinguish 
successfully transfected cells. The pXCThy expression plasmid 
encodes the murine Thyl.2 protein as described above. The pXC 
expression plasmid does not encode any proteins and is used as a 
control. 

Flow cytometry. At 48 h after transfection, cells were stained 
with a monoclonal antibody to the Murine Thy1.2 cell-surface 
protein directly conjugated with fluorescein isothiocyanate (FITC) 
and propidium iodide (PI) and analyzed by flow cytometry using 
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Fig. 4 a. Stereo view of the final set of 10 struc- 
tures. The structures are oriented so that the axis 
of a- helix 3 is pointing towards the viewer. The 
almost perpendicular arrangement of helices 2 
and 3 is clearly visible. Some representative side 
chains are shown in green to illustrate the defini- 
tion of the core of the molecule, b. Ribbon repre- 
sentation of the structure looking at the side of 
the helix bundle, c Detailed view of the GFP- 
loop-helix 3 interface showing the backbone of 
all residues in beige, the backbone of residues 
331-333 and 355,356 in blue and the highly con- 
served side chains in green, d, Top and e, bottom 
view of the UBA domain with contact surface. 
Shading according to charge density. Surface and 
charge distributions have been calculated using 
MolMol with standard charge settings. 







the Lysis II software (Becton Dickson) as described^°. Gates were set 
to exclude clumps and cell debris and on the Thy1.2- and Thy1.2* 
populations. The Thyl,2- and Thy1.2* populations were distin- 
guished by comparing the staining pattern of Thy1.2 transfected 
populations to that of a mock-transfected population of cells. A 
total of 800-2,000 Thyl .2* events were acquired for each sample 
except in the case of mock infected cells, where 10,000 Thyl .2" 
events were acquired. The (G1/G2+M) ratios presented are those 
calculated from the Thyl. 2* cells except in the case of mock trans- 
fected ceils, where the ratio represents that of the Thyl. 2- popula- 
tion. The Thy1.2-FITC fluorescence intensity is plotted on the X axis 
and the fluorescence intensity of PI is plotted on the Y axis. 

Western blot analysis. Cell lysates from transfected Mela cells 
were prepared as described Proteins were separated by SDS PAGE 
and electro- blotted to nitrocellulose membranes. Blots were incu- 
bated with a mouse monoclonal antibody specific for the M2 FLAG 
(Eastman Kodak Co.) followed by incubation with a horseradish per- 
oxidase-linked sheep anti-mouse immunoglobulin (Amersham Life 



Science Inc.). Antibody binding was detected using the ECL detec- 
tion kit (Amersham Life Science (nc). 

Synthesis of peptides and sample preparation. All peptides 
were synthesized by the Fmoc (fluorenylmethoxycarbonyl)/t-butyl 
based solid phase peptide chemistry method on an ABI 431 A (Perkin 
Elmer) peptide synthesizer utilizing a single coupling program to 
carry out the chain assembly. The crude peptide was dissolved in 8 M 
guanidine-HCl, and purified to >95% homogeneity by preparative 
reverse-phase HPLC on a Vydac Cia (2.5 cm x 25 cm) column with linear 
gradients consisting of 0.1%TFA (v/v) in water and 0.05%TFA (v/v) in 
acetonitrile. The composition and purity of each peptide was assessed 
using electro-spray ionization (ESI) mass spectrometry and amino acid 
analysis. NMR samples were prepared by dissolving the purified, 
lyophilized peptide in the NiVlR buffer (50 mM potassium phosphate 
buffer at pH 6.5 in 99.9996% D2O or 90% H20/10% D2O). Undissolved 
peptide was spun down in a microcentrifuge prior to transferring the 
samples into standard 5 mm NMR tubes. The resulting samples were 
between 0,8 and 1.0 mM in peptide in 450 ^1 solution. 
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h!MR spectroscopy. All spectra were acquired on Bruker Avance 
DRX spectrometers at 500 and 600 MHz. Water suppression was 
achieved using presaturation or WATERGATE^^ sequences. Two- 
dimensional DQF-C0SY13, TOCSY^<i5 and NOESYi^ spectra were mea- 
sured at 293 K in 100% DjO and 90% H20/10% D2O at 293 K. The 
TOCSY datasets were acquired using the CITY^' or MLEV-17" mixing 
sequences with mixing times of 50 and 80 ms, for NOESY spectra mix- 
ing times of 80, 150, 250 and 300 ms were applied. The spectra were 
acquired with data matrices of 300*1024 complex points in Fi and F2, 
respectively. For the DQF-COSY 800 points were acquired in F,. In 
addition a ^H-^^C correlation at natural abundance was acquired 
using a sensitivity enhanced, gradient selected HSQC pulse 
sequence". For this data set 160 scans per ti increment were acquired 
with 40*1024 complex points in F, and F^ respectively. All spectra 
were processed using squared sine bell window functions that were 
shifted by kJZ and jd2 in F, and Fj respectively. The final data matrices 
for all data sets were 1024*1024 real points. 

Data analysis and structure calculations. The data analysis and the 
assignment of proton resonances were facilitated by the use of the 
software package XEasy. The structure calculations were done with 
the program DYANA". A total of 50 structures were generated using 
the standard DYANA protocol; the 10 structures with the lowest value 
for the target function were used for the final analysis using the pro- 
grams MolMol^** and Insightll (Biosym Inc.). Protein secondary struc- 
ture prediction was performed using the EMBL PHD mail server^^ 

Coordinates. The coordinates of the ten final structures of the 
HHR23A C-terminal DBA domain have been deposited in the 
Protein Data Bank (accession number 1UBA). 
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Stiryctyres of Cdc42 
boyodl to the actove aimd 
cataDytHcaDlly commproimiisedl 
forms of Cd]c42GAP 

The Rho-related small GTP-binding protein Cdc42 has a low 
intrinsic GTPase activity that is significantly enhanced by its spe- 
cific GTPase-activating protein, Cdc42GAP. In this report, we 
present the tertiary structure for the aluminum fluoride-pro- 
moted complex between Cdc42 and a catalytically active domaki 
of Cdc42GAP as well as the complex between Cdc42 and the cata- 
lytically compromised Cdc42GAP(R305A) mutant. These struc- 
tures, which mimic the transition state for the GTP hydrolytic 
reaction, show the presence of an AIF3 molecule, as was seen for 
the corresponding Ras-pl20RasGAP complex, but in contrast to 
what has been reported for the Rho-Cdc42GAP complex or for 
heterotrimeric G protein a subunits, where AIF4- was observed. 
The Cdc42GAP stabilizes both the switch I and switch II 



domains of Cdc42 and contributes a highly conserved arginme 
(Arg 305) to the active site. Comparison of the structures for the 
wild type and mutant Cdc42GAP complexes provides important 
insights into the GAP-catalyzed GTP hydrolytic reaction. 

GTPase-activating proteins (GAPs) are critical regulators of ail 
signaling pathways mediated by members of the Ras superfamily 
GAPs serve to attenuate these signals by accelerating the hydroly- 
sis of GTP to GDR Recent biochemical and structural studies 
demonstrating the ability of aluminum fluoride to act as a transi- 
tion state analog in the hydrolytic pathway of low molecular 
weight GTP-binding proteins has provided a new tool to investi- 
gate the mechanism of GTP hydrolysis and the role of GAPs in 
accelerating this reactioni. Previous studies from our laboratory 
have demonstrated the ability of aluminum fluoride to induce 
stable complex formation between the GDP-bound form of 
Cdc42 and its Cdc42GAP2. Here, we report the crystal structures 
of the aluminum fluoride-promoted transition state complex 
between Cdc42 and the wild type Cdc42GAP (sometimes 
referred to as RhoGAP^), as well as the complex between Cdc42 
and the catalytically compromised Cdc42GAP(R305A) mutant 
and discuss the mechanistic implications of these structures for 
GAP'catalyzed GTP hydrolysis. 
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Figure 1: Indd 
Signaling protein 

Structure of neddS 



Accession number: PF00240 



Ubiquitin family 



c 



This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue) 
(see ULP1 YEAST ), NeddS (see NEDS IVIOUSE ). Elongin B (see Q 15370 ). Rubl 
(see Q9SHE7 ). 

lEV! This family forms structural complexes with other Pfam families, to view 
them click here 



INTERPRO description (entry iPR000626 ) 

Ubiquitin r MEDLINE:91274342 L PUBOOG00768, PUB00000768 is a protein of 
seventy six amino acid residues, found in alleukaryotic cells and whose sequence 
is extremely well conserved from protozoan to vertebrates. It is widely known as 
a post-translational tag used to signal a protein's hydrolytic destruction. Other 
functions for ubiquitin, depend on its differential internal isopeptide linkages. In 
addition, several ubiquitin-like proteins have been discovered from genome- 
sequencing efforts, other structural studies, and genetic screens. These new data 
show that proteins with the ubiquitin domain are adaptable, transposable genetic 
elements, which have been appended to other genes and utilized for many 
different cellular functions, depending on the ubiquitin-like protein's identity, 
subcellular location, and method of covalent attachment. The post-translational 
ligation of proteins to members of the ubiquitin superfamily can signal many 
different fates for the target protein [Larsen and Wang (2002) J. Proteome Res. 
1, 411-419.] 

Ubiquitin is a globular protein, the last four C-terminal residues (Leu-Arg-Gly-Gly) 
extending from the compact structure to form a 'tail* important for its function. 
The latter is mediated by the covalent conjugation of ubiquitin to target proteins, 
by an isopeptide linkage between the C-terminal glycine and the epsilon amino 
group of lysine residues in the target proteins. 

In most species, there are many genes coding for ubiquitin. However they can be 
classified into two classes. The first class produces polyubiquitin molecules 
consisting of exact head to tail repeats of ubiquitin. The number of repeats is 
variable. In the majority of polyubiquitin precursors, there is a final amino-acid 
after the last repeat. The second class of genes produces precursor proteins 
consisting of a single copy of ubiquitin fused to a C-terminal extension protein 
(CEP). There are two types of CEP proteins and both seem to be ribosomal 
proteins. There are a number of proteins which are evolutionary related to 
ubiquitin, including ubiquitin-like proteins from baculoviruses ; mammalian 
proteins GDX, FAU and RAD23-related proteins; human spliceosome associated 
protein 114, proteins BAT3 and CKAPl/fFCB, and ubiquitin-like proteins SMT3A, 
SMT3C and SMT3B; yeast proteins RAD23, DK2 and SMT3; and C. elegans SMT3 
and ubl-1 proteins. 



For additional annotation, see the PROSITE document pdoc00271 [Expasy l SRS-UK I SRS-USA ] 
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Alignment: 0964rad23,sid2 - pfam|hmm|lJBA 

HMMPFAM - alignment of 0964rad23.sid2 against pfam|hmni|UBA 

UBA/TS-N domain 

• This hit is scoring at : 52.5 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 159 NLEQTIQQILDMGGGTWERDTVVRALRAAYNN—PERAIDYLYSG 201 

E:.::Q: :MGGG.:.R:.. :ALRA. NN ERA:::L.. 
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HMMPFAM - alignment of 0964rad23.sid2 against pfam|hmm|UBA 

UBA/TS-N domain 

• This hit is scoring at : 33.6 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 
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Legend of Alignment 

: positive score 

score between -2 and 0 
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Accession number: PF00627 



UBA/TS-N domain 



C 



This small domain is composed of three alpha helices. This family includes the 
previously defined UBA and TS-N domains. The UBA-domain (ubiquitin 
associated domain) is a novel sequence motif found in several proteins having 
connections to ubiquitin and the ubiquitination pathway. The structure of the 
UBA domain consists of a compact three helix bundle [1]. This domain is found 
at the N terminus of EF-TS hence the name TS-N. The structure of EF-TS is 
known and this domain Is implicated in its interaction with EF-TU [2]. The 
domain has been found in non EF-TS proteins such as alpha-NAC P70670 and 
MJ0280 Y280 MET J A [1]. 

This family forms structural complexes with other Pfam families, to view 
them click here 



INTERPRO description (entry iprooo449 ) 

UBA domains are a commonly occurring sequence motif of approximately 45 
amino acid residues that are found in diverse proteins involved in the 
ubiquitin/proteasome pathway, DNA excision-repair, and cell signaling via 
protein kinases [ MEDUNE;97025177 1. The human homologue of yeast Rad23A 
is one example of a nucleotide excision-repair protein that contains both an 
internal and a C-terminal UBA domain. 

The solution structure of human Rad23A UBA(2) showed that the domain forms 
a compact three-helix bundle [ MEDLINE:99061330 1. Comparison of the 
structures of UBA(l) and UBA(2) reveals that both form very similar folds and 
have a conserved large hydrophobic surface patch which may be a common 
protein-interacting surface present in diverse UBA domains. Evidence that 
ubiquitin binds to UBA domains leads to the prediction that the hydrophobic 
surface patch of UBA domains interacts with the hydrophobic surface on the 
five-stranded P -sheet of ubiquitin r MEDLINE:2207534n . 



For additional annotation, see the PROSITE document pdoc50015 [ Expasv I SRS-UK I SRS-USA ] 



Alignment 


Domain organisation 


« Seed (162) O Full (473) 
Format | Coloured alignment ^ 


Seed (162) G Full (473) O Context (51) 

As a Graphic As a Tree 

Zoom |0:5 J pixels/aa. D Bootstrap tree 


Further alignment options here 
Help relating to Pfam alignments here 


To find out about the NIFAS tree-viewer, click here 







http://www.sanger.ac.uk/cgi-bin/Pfam/getacc 7PF00627 
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Species Distribution 


Phylogenetic trcse 


View alignments & domain organisation by species 


<?' Seed (162) C Full (473) 


Tree depth :)Show all levels |s| 




The trees were generated using Quicktree 
To find out more about ATV phyloaenetic tree-viewer click here 



Database References 


PDB 

You can find out how to set up Rasmol here 


|laipC;2;42; ^^^Jj^j^^^ ^^^j/^jljjjjj^ 
^^b^tnsjj^j^l^ ^^^^^^^^^ 


URL 


http://www.isrec.isb-sib.ch/profile/isrec96/poster.html 


PROSITE 


PDOC50015 [ Expasv I SRS-UK I SRS-USA] 


PROSITE_PROFILE 


PS50030 


COGS 


COG0264 COG 1308 


HOMSTRAD 


EF TS 


PFAMB 


PB005809 PB015819 PB016122 PB038705 PB073598 PB073788 PB074C 


SYSTERS 


UBA 


PANDU 


UBA 



Literature References 


Pfam specific Information 


The structure of the 
Escherichia coll EF-Tu.EF-Ts 
complex at 2.5 A resolution. 
Kawashima T, Berthet-Colominas 
C, Wulff M, Cusack S, Leberman R; 
Nature 1996;379:511-518, 


Author of entry 


Bateman A 


Type definition 


Domain 


Alignment method of seed 


Manual 


Source of seed members 


Bateman A 



2. Alignment of TS-N domain is in 



figure 7c. 

Comparative genomics of 
the Archaea 


HMMER build information 


(Euryarchaeota): evolution 
of conserved protein 




Pfam Is [Download HMM] 


Pfam fs [Download HMM] 


families, the stable core, and 
the variable shell. 


Gathering cutoff 


17.3 17.3; 


19.0 19.0 


Makarova KS, Aravind L, Gatperin 
MY, Grishin NV, Tatusov RL, Wolf 
YI, Koonin EV; 

Genome Res 1999;9:608-628. 


Trusted cutoff 


17.3 17.3; 


19.0 19.0 


Noise cutoff 


17.2 17.2; 


18.9 18.9 


3, 

Structure of a human DNA 


Build method of HMI^ 


hmmbuild -F HMMJs SEED 
hmmcalibrate -seed 0 HMMJs 


hmmbuild -f -F HMMJs SEED 
hmmcalibrate -seed 0 HMM_fs 



repair protein UBA domain 
that interacts with HIV-1 Vpr. 



http://www.sanger.ac.uk/cgi-bin/Pfain/getacc7PF00627 
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Dieckmann Withers-Ward ES, 
Jarosinski MA, Liu CF, Chen IS, 
Feigon J; 

Nat Struct Biol 1998;5:1042-1047. 



For help on making stable links to this page click here 



Comments or questions on the site? Send a mail to pfam(a>sanger.ac-uk 



http://www.sanger.ac.uk/cgi-bin/Pfani/getacc7PF00627 
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bioSCOUT 1.5.3 - Alignment: 0964rad23.sid2 - pfam|hmm|UPF0184 Page 1 of 1 

Projects 1 1 Analyses | Admin | j Alert 1 1 siPS | ' Tutorial: 

|:. ';-ieatur|il|l 

Alignment: 0964rad23.sid2 - pfam|hmm|UPF0184 

HMMPFAM - alignment of 0964rad23.sid2 against pfam|hmm|UPF0184 

Uncharacterised protein family (UPF0184) - 

• This hit is scoring at : -24.6 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 24 4 -AGPNANPlnlfpqGVPSGGSNPGVVPGAGSGALDAL-RQLPQFQALLQLVQANPQILQP 
:GPN.:, .... . G . . :A: . . L . Q. . : . L . . : : . . . L.. 

H : 1 MSGpNgdd psSvedggntnedef glqEyeaiNsiuLDqinsaLDalEernDdlins 

MLQELGKQNPQILRLIQENqaeflrLVNESPEGGPG 337 
.L:EL :.N QI ..E. L N.:PE.G.. 

kLrELLeSNRqiRlef aeq IdnnAPesgsd 84 

Legend of Alignment 

: positive score 

score between -2 and 0 

Please report problems and feedback concerning bioSCOUT through the support interface. 




bioSCOUT' 
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Admin \\ Alert | [ SRS 


1 1 Tutorial 




Help 


Analysis Browser: \ 


Level Up 



Report for 
Description 



0964rad23.sid4 (Protein) 



0964 rad23 sid4 



Update 



Edit 



Funetion 



Features 
Summary 



Homologies 



RAD23 protein, isoform II 

Direct assignment of functionality by homology to 

trembl|Y12014 | DCRAD23II_l 

in region 1 to 366 for overall length of 21 (99% of query, 1742% of hit, see the 
alignment ). 

Functional class Replication 

Extracted keywords DNA damage. Nuclear protein. DNA rep air 



query 



pf amsearch 

blocks 

seg 

phd-sec 

phd-acc 

biasdb 

function 

3D-structure 



126 



252 



i l I ■ 



«fl— hM-i- 



-«»}HMIIi 



Protein 



70 clear homologs 



All BLAST hits 

All protein BLAST 
hits 



http://eshpc02.es.dupont.com:8101/cgi-bin/bioSCOUT/SERV/analysis?aid=9241 
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General 



3D Structure 



Expression 



Phylogeny 



Features 



ESTs 
Patents 

Gene name 

Molecular weight 

Sequence length 

Isoelectric point 

Predicted cellular 
localisation (PHD 
and PreLoc) 

Identical sequence 
segments in: 



180 homologs 
177 homologs 

39.87 kD 

368 

4.41 

cytoplasmic (64.8 %) 



embl | AI901927 | AI901927 
genbanki AI90 1 927| AI90 1 927 



All EST BLAST hits 
All patent hits 



3D structure inferred by clear homology from residues 324 to 368 in 1F4I-A 
View aligmnent 
pdb|lF4I | lF4I-A structure 

Expression of this gene is reported for 



Organ Category 
Development Stag e 
Tissue Classification 

Distribution 
Taxa 

Model organisms 



Low complexity 
region 

P-rich region 
S-rich region 
G-rich region 
No significant hits 



other species 

tassel length from 0.1 to 2.5 cm 
normal tissue 

32 species extracted from 189 Species 
homologous sequences. 

Chordata, Eukaryotae, Fungi, 
Planta 

Arabidopsis thaliana, 
Caenorhabditis elegans, 
Drosophila melanogaster, 
Homo sapiens, Mus musculus, 
Saccharomyces cerevisiae 

from 75 to 88, from 98 to 115, from 230 to 241 
detected by [seg] 

from 94 to 129 detected by [biasdb] 
from 75 to 144 detected by [biasdb] 
from 230 to 241 detected by [biasdb] 
[Coils] [Phd-tm] 



http://eshpc02.es.dupont.com:8101/cgi-bin/bioSCOUT/SERV/analysis?aid=9241 
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detected by 



Patterns 



UBA/rS-N domain 
region 



Ubiquitin family 
region 

No significant hits 
found in 



from residue 324 to 363. Source: [ pfamsearch] . 
Quality: (E=2e-09) 

from 144 to 186. Source: [ pfamsearch] . Quality: 
(E=2.7e-ll) 

from residue 1 to 78. Source: [ pfamsearch] . Quality: 
(E=2.3e-23) 

[prosite database] [blocks database] 



Comment 



No comment section. 



Edit 



Completed Taslis ^^^^ ^j^^ ^ser Comment 



03.04.2003, dressvm 
12:46:38 



Output 

bioSCOUT defauh details. 



Interactiv) 



Permissions 



Edit 



Alert Jobs 



New 



Please report problems and feedback concerning bioSCOUT through the su pport interface. 
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Entry Page 



Page 1 of 1 



gjL ;^ 



view 



Complete entries * |R| 



This entry is from: TREMBL:DCRAD23II_1 



TREMBL 



Save 



Link 



HeM Analyses I 
Printer Friendly 



ID 
AC 
DR 
DE 
DE 
OS 
OC 
OC 
OC 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



1 standard; PRT; 379 AA. 



DCRAD23II 
Y12014; 

EMBL; Y12014; DCRAD23II. 
product: "RAD23 protein, isoform II"; 
Daucus carota mRNA for RAD23 protein, isoform II 
Daucus carota (carrot) 

Eukaryota; Viridiplantae; Streptophyta; Ertibryophyta; Tracheopl' 
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; / 
euasterids II; Apiales; Apiaceae; Daucus, 
CDS 40. .1179 

/db_xr e f = " S PTREMBL : OO 3 9 9 1 " 
/product="RAD23 protein, isoform II" 
/f unction="assembly factor of the complex for 
excision repair of V-damaged DNA" 
/protein_id="CAA72742 . 1" 

/translation="MKLTVKTLKGSHFEIRAQPNDTVMAIKKNIE[ 
LIHNGKVLKDESTLAESKISEDGFLVVMLGKSKTMSSTGTPAAQS5 
APAPAAAPASAVIPNTTVPEAPLSPAFAPSDTYGEAASNVVAGSNI 
MWDTNMVSRALRAAYNNPERAVDYLYSGIPEMAEAAVPVSHFQGDC 
AGAAPGAPNSLPLNMFPQETLSGVTGAGLGSLEFLRNNPQFQTLRS 
LELGKQNPQLLRQIQEHHEEFLQLINEPVEASEGDMFDQPEQDVPC 
RLEAMGFDRGLVIEAFLACDRNEELAVNYLLENAGDFED" 
Sequence 379 BP; 
>DCRAD23II_1 

MKLTVKTLKGSHFEIRAQPNDTVMAIKKNIEDLQGKDNYPCGQQLLIHNGKVLKDESTLA 
ESKISEDGFLVVMLGKSKTMSSTGTPAAQSSSAPAPTPAPAVAPAPAPAAAPASAVIPNT 
TVPEAPLSPAFAPSDTYGEAASNVVAGSNLEQTIQHIMDMGGGMWDTNMVSRALRAAYNN 
PERAVDYLYSGIPEMAEAAVPVSHFQGDQINAGNNAISDNGVAGAAPGAPNSLPLNMFPQ 
ETLSGVTGAGLGSLEFLRNNPQFQTLRSMVQRNPQILQPMLLELGKQNPQLLRQIQEHHE 
EFLQLINEPVEASEGDMFDQPEQDVPQEITVTAADQEAIERLEAMGFDRGLVIEAFLACD 
RNEELAVNYLLENAGDFED 



SRS 6.1.3.10 I feedback 



http://eshpc02.es.dupontxoni:8 1 01/srsbin/cgi-bin/wgetz?[trembl-id:'DCRAD23n_r]+-e+^^ 4/3/2003 
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bioSCOUT 



i Projects I [::- Analyses | j-:?Adr^ ^Mj^fel | , futbrial 



ii|;||eature^^^^ 



Alignment: 0964rad23.sid4 - pdb|lF4I|lF4I-A 

BLASTP - alignment of 0964rad23,sid4 against pdb|lF4I|lF4I-A 

uv excision repair protein protein rad23 homolog afragment: c-terminal uba domain;Mutant 

• This hit is scoring at : 3e-05 (expectation value) 

• Alignment length (overlap) : 45 

• Identities : 46 % 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

• Database searched : nrdb 

Q: 324 EEQEAIGRLESMGFDE^RVIEAFLACDRNEELAANYLLEHAGEED 368 

:E:EAI RL:::GF:.: VI : A: . AC : : NE . LAAN : LL . . ::: 
H: 1 QEKEAIERLKALGFEESLVIQAYFACEKNENLAANFLLSQNFDDE 45 



Legend of Alignment 



: positive score 

score between -2 and 0 



Please report problems and feedback conceming bioSCOUT through the su pport interface. 
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1 p. Tutorial 








Features 



Alignment: 0964rad23.sid4 - pfam|hmm|ubiquitin 



HMMPFAM - alignment of 0964rad23.sid4 against pfam|hmm|ubiquitin 

Ubiquitin family 



This hit is scoring at : 91.1 

Scoring matrix : BLOSUM62 (used to infer consensus pattern) 



Q: 1 MKLTVKTLKGT HFEIRVQPNDTIMAVKKNIEEIQGkdsYPWGQQLLI FNGKVL 

MK: VKTL.G. . . : . V: P : DT : : . : K: . IE : :G .P QQ LI : GKVL 

H: 1 mkifVKtldgkgkvtdgstitlevepsdTVlelKekledkeg. . . ippeqQrLiyaGkvL 

-KDESTLEENKVNEDGFLVVMLSKGK 7 8 

:D: :TL.E . : .E. .F. : : : : : 
reDdttLaeYniqegsftlhlvlrlr 83 



Legend of Alignment 

: positive score 

score between -2 and 0 

Please report problems and feedback conceming bioSCOUT through the sup port interface. 



http://eshpc02.es.dupont.com:8101/cgi-bin/bioSCOUT/WWW/aliview?file==/LION/data/res... 4/3/2003 
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Protein families database of alignments and HMMs 



ubiquitin 




Figure 1: ld3z 
Hydrolase 

Ubiquitin nmr structure 



Accession number: PF00240 



Ubiquitin family 



c 



This family contains a number of ubiquitin-lil<e proteins: SUI^O (smt3 homologue) 
(see ULP1 YEAST ), NeddS (see NED8_MQUSE), Elongin B (see Q15370 ), Rubl 
(see Q9SHE7). 

IE«! This family forms structural complexes with other Pfam families, to view 
them click here 



INTERPRO description (entry IPR0Q0626 ) 

Ubiquitin [ MEDLINE:91274342 L PUB00000768, PUB00000768 is a protein of 
seventy six amino acid residues, found in alleukaryotic cells and whose sequence 
is extremely well conserved from protozoan to vertebrates. It Is widely known as 
a post-translational tag used to signal a protein's hydrolytic destruction. Other 
functions for ubiquitin, depend on its differential internal isopeptide linkages. In 
addition, several ubiquitin-like proteins have been discovered from genome- 
sequencing efforts, other structural studies, and genetic screens. These new data 
show that proteins with the ubiquitin domain are adaptable, transposable genetic 
elements, which have been appended to other genes and utilized for many 
different cellular functions, depending on the ubiquitin-like protein's identity, 
subcellular location, and method of covalent attachment. The post-translational 
ligation of proteins to members of the ubiquitin superfamily can signal many 
different fates for the target protein [Larsen and Wang (2002) J. Proteome Res. 
1, 411-419,] 



Ubiquitin is a globular protein, the last four C-terminal residues (Leu-Arg-Gly-Gly) 
extending from the compact structure to form a 'tail' important for its function. 
The latter is mediated by the covalent conjugation of ubiquitin to target proteins, 
by an isopeptide linkage between the C-terminal glycine and the epsilon amino 
group of lysine residues in the target proteins. 



In most species, there are many genes coding for ubiquitin. However they can be 
classified into two classes. The first class produces polyubiquitin molecules 
consisting of exact head to tail repeats of ubiquitin. The number of repeats is 
variable. In the majority of polyubiquitin precursors, there is a final amino-acid 
after the last repeat. The second class of genes produces precursor proteins 
consisting of a single copy of ubiquitin fused to a C-terminal extension protein 
(CEP). There are two types of CEP proteins and both seem to be ribosomal 
proteins. There are a number of proteins which are evolutionary related to 
ubiquitin, including ubiquitin-like proteins from baculovi ruses : mammalian 
proteins GDX, FAU and RAD23-related proteins; human spliceosome associated 
protein 114, proteins BAT3 and CKAPl/TFCB, and ubiquitin-like proteins SMT3A, 
SMT3C and SMT3B; yeast proteins RAD23, DK2 and SMT3; and C. eleqans SMT3 
and ubl-1 proteins. 



For additional annotation, see the PROSITE document pdoc00271 [ Expasy I SRS-UK | SRS-USA ] 



I 1 

http://www.sanger.ac.uk/cgi-bin/Pfain/getacc7PF00240 
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Alignment 



Domain organisati n 



(? Seed (95) O Full (1322) 



Format 



Coloured alignment 



Get alignment 



<• Seed (95) O Full (1322) 

As a Graphic As a Tree 

Zoom jO.5 i pixels/aa. □ Bootstrap tree 



Further alignment options here 

Help relating to Pfam alignments here 



To find out about the NIFAS tree-viewer, click here 



Species Distribution 



Phylogenetic tree 



View alignments & domain organisation by species 
Tree depth :[lhov!^^ 



^ Seed (95) O Full (1322) 



The trees were generated using Quicktree 
To find out more about ATV phylogenetic tree-viewer click here 



Database References 


PDB 

You can find out how to set up Rasmol here 


|la5r;20;95; ^ ^g^jjff^^gj^^ ^^^{^^^^^ ^ 
j^^^j^^^' ^^^^^^^^^ ^ 


PRINTS 


PR00348 


PROSITE 


PDOC00271 (ExDasvlSRS-UK|SRS-USA] 


HOMSTRAD 


UBQ 


PFAMB 


PB013905 PB018771 PB024244 PB038815 PB039385 PB047048 PB0732 
PB074986 PB075222 PB075546 PB076014 


SYSTERS 


ubiquitin 


PANDIT 


ubiquitin 



Literature References 



Pfam specific information 



Crystal structure of the 
human ubiquitin-like protein 
NEDD8 and interactions with 
ubiquitin pathway enzymes. 
Whitby FG, Xia G, Pickart CM, Hill 
CP* 

J Biol Chem 1998; 273:34983- 
34991. 



Author of entry 



Type definition 



Alignment method of seed 



Source of seed members 



Finn RD, Griffiths- Jones SR 



Domain 



Clustalw 



Prosite 



Structure determination of 
the small ubiqultln-related 
modifier SUMQ-1. 
Bayer P, Arndt A, Metzger S, 
Mahajan R, Melchior F, Jaenicke 



HMMER build information 




Pfam Is [Download HMM] 


Pfam fS [Download HMM] 


Gathering cutoff 


25.5 25.5; 


11.2 11.2 



http://www.sanger.ac.uk/cgi-bin/Pfam/getacc7PF00240 
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Becker J* 

J Mol Biol 1998;280:275-286. 



3. 

Structure of tetraubiquitin 
shows how multiubiquitin 
chains can be formed. 
Cook W, Jeffrey LC, Kasperek E, 
Pickart CM; 

J Mol Biol i994;236:601-609. 



Structure of ubiquitin refined 
at 1 .8 A resolution. 
Vijay-Kumar S, Bugg CE, Cook WJ; 
J Mol Biol 1987;194:531-544. 



Trusted cutoff 


26.2 26.1; 


11.2 11.2 


Noise cutoff 


25.2 25.2; 


11.1 11.1 


Build method of HI^IM 


hmmbuild -F HMMJs SEED 


hmmbuild -f -F HMMJs SEED 




hmmcalibrate -seed 0 HMMJs 


hmmcalibrate -seed 0 HMM_fs 



For help on making stable links to this page dicl< here 
Comments or questions on the site? Send a mail to pfam(S)sanger,ac.uk 



http://www.sanger.ac.uk/cgi-bin/Pfani/getacc7PF00240 
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Alignment: 0964rad23.sid4 - pfam|hmm|UBA 




bioSCOUT 



HMMPFAM - alignment of 0964rad23.sid4 against pfam|hmm|UBA 

UBA/TS-N domain 

• This hit is scoring at : 5 1.0 

• Scoring matrix : BLOSUM62 (used to infer consensus pattern) 

Q: 144 NVDTIINQLMEMGGGSWDKDKVQRALRAAYNN—PERAVEYLYSG 186 

:. :.QL.EMGGG :.:::.: :ALRA. NN ERAVE:L.. 
H : 1 ideekvkqLreMGGGvf sreeakkALratnnnsDverAvewLleh 45 



HMMPFAM - alignment of 0964rad23.sid4 against pfam|hmm|UBA 

UBA/TS-N domain 

• This hit is scoring at : 44.7 

• Scoring matrix : BLOSUM62 (used to infer consensus pattem) 

Q: 324 EEQEAIGRLESMG FDRARVIEAFLACDRN— EELAANYLLEH 363 

::E.: :L..MG F.R... :A. A.:.N EA..:LLEH 
H: 1 ideekvkqLreMGGGvf SreeakkALratnnnsDverAvewLleh 4 5 



Legend of Alignment 

: positive score 

score between -2 and 0 



Please report problems and feedback concerning bioSCOUT through the support interface. 
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Protein families database of alignments and HMMs 



UBA 




Figure 1: lefu 

C mplex (two elongation factors) 

Elongation factor complex ef-tu/ef-ts 
from escherichia coli 



Accession number: PF00627 
UBA/TS-N domain C 



This small domain is composed of three alpha helices. This family includes the 
previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated 
domain) is a novel sequence motif found in several proteins having connections 
to ubiquitin and the ubiquitination pathway. The structure of the UBA domain 
consists of a compact three helix bundle [1]. This domain is found at the N 
terminus of EF-TS hence the name TS-N. The structure of EF-TS is known and 
this domain is implicated in its interaction with EF-TU [2]. The domain has been 
found in non EF-TS proteins such as alpha-NAC P70670 and MJ0280 
Y280 METJA [1]. 

This family forms structural complexes with other Pfam families, to view 
them click here 



INTERPRO description (entry 1PR000449) 

UBA domains are a commonly occurring sequence motif of approximately 45 
amino acid residues that are found in diverse proteins involved in the 
ubiquitin/proteasome pathway, DNA excision-repair, and cell signaling via protein 
kinases f MEDLINE:97025177 1. The human homologue of yeast Rad23A is one 
example of a nucleotide excision-repair protein that contains both an internal and 
a C-terminal UBA domain. 

The solution structure of human Rad23A UBA(2) showed that the domain forms a 
compact three-helix bundle r MEDLINE:99Q61330 1. Comparison of the structures 
of UBA(l) and UBA(2) reveals that both form very similar folds and have a 
conserved large hydrophobic surface patch which may be a common protein- 
interacting surface present in diverse UBA domains. Evidence that ubiquitin binds 
to UBA domains leads to the prediction that the hydrophobic surface patch of 
UBA domains interacts with the hydrophobic surface on the five-stranded P - 
sheet of ubiquitin r MEDLINE:22075341 1. 



For additional annotation, see the PRQSITE document PDOC50015 [ Expasv I SRS-UK j SRS-USA ] 



Alignment 



Domain organisation 



<5 Seed (162) O Full (473) 



Format Coloured alignment 



Get alignment: 



Further alignment options here 

Help relating to Pfam alignments here 



Seed (162) O Full (473) O Context (51) 

As a Graphic As a Tree 

Zoom |0^5^ pixels/aa. □ Bootstrap tree 
U^A^: Graphic \ ry^^^^^^- | 



To find out about the NIFAS tree-viewer, click here 



http://www.sanger.ac.uk/cgi-bin/Pfani/getacc7PF00627 
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Species Distribution 


Phylogenetic tree 


twzw. vicw anynrncncs ot uomain organisauun uy spccics 


<• Seed (152) O Full (473) 


Tree depth :|Show all levels gj 


'S Download treetfs jft Alp/S^pplet 


The trees were Generated using Quicktree 
To find out more about ATV phylogenetic tree-viewer click here 
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